Question
Why is mapper output usually stored on local disk rather than HDFS?
Select an option. Your answer will be checked instantly.
Correct Answer: B. It is temporary data that can be regenerated if the mapper fails
Explanation:
Intermediate output is consumed by reducers and is not final durable data.
Re-execution is often cheaper than HDFS replication.
Leave a Reply