0

It's a well known optimization to replace GroupByKey with ReduceByKey, since the latter reduces shuffling. I was wondering if there are reverse cases in which code with GroupByKey is faster than with ReduceByKey.

Joel
  • 1,564
  • 7
  • 12
  • 20
alexgbelov
  • 3,032
  • 4
  • 28
  • 42

0 Answers0