Question
What is the main cost of a reduce-side join?
Select an option. Your answer will be checked instantly.
Correct Answer: A. Both datasets may be shuffled across the network
Explanation:
Reduce-side joins are general but communication-heavy.
Partitioning and skew strongly affect their performance.
Leave a Reply