Question
What does distinct do?
Select an option. Your answer will be checked instantly.
Correct Answer: D. Removes duplicate elements from an RDD
Explanation:
Spark must group equal elements to determine uniqueness.
Therefore distinct commonly requires a shuffle.
Leave a Reply