Question
What is a disadvantage of long MapReduce job chains?
Select an option. Your answer will be checked instantly.
Correct Answer: D. Each stage may materialize intermediate data to disk and add scheduling overhead
Explanation:
Disk I/O and repeated job startup increase latency.
In-memory engines such as Spark can be faster for iterative pipelines.
Leave a Reply