I tried a Hive process, which generate words frequency rank from sentences, I would like to output not multiple files but one file.
I searched the similar question this web site, I found mapred.reduce.tasks=1, but it didn't generate one file but 50 files.
The process l tried has 50 input files and they are all gzip file.
How do I get one merged file? 50 input files size is so large that I suppose the reason may be some kind of limit.