I have an mrjob that consists of 3 steps. The second step expects as input the results of the first step plus some more content from S3.
I understand that I can always "stream" it through the first step, meaning emit is as is, and only use it in the second step, but I would like to avoid this.
Is there a way to define additional input to later steps in mrjob?