0

Apache Beam's Reshuffle was marked as deprecated in May 2017 with the note

For internal use only; no backwards compatibility guarantees.

In addition, the DataflowRunner installs a ReshuffleOverrideFactory which I'm unclear of how changes the reshuffling.

Anyway, the JavaDoc doesn't mention what to use instead. How are users supposed do deal with ParDo transforms with high fan out in general and on Dataflow?

gogstad
  • 3,607
  • 1
  • 29
  • 32

1 Answers1

1

You can look at withFanout option in GroupByKey and Combine operation. Here is the link to the Java API - https://beam.apache.org/releases/javadoc/2.0.0/org/apache/beam/sdk/transforms/Combine.Globally.html#withFanout-int-

Jayadeep Jayaraman
  • 2,747
  • 3
  • 15
  • 26