In my reducer, I require the total number of "lines" of input that were processed by the mappers.
sample input:
- line,1,of,input
- line,2,of,input
- line,3,of,input
So, in all of the Reducers, I need to have access to the whatever was emitted by the Mappers plus the total number of lines (in this case 3).
I'm assuming that I will need either multiple jobs or chain together some mappers and/or reducers but I'm unsure of the proper way.
Note: This is not a simple average program, so I can't just have a single key from the mapper.