I'm using insert into
command to load data from txt table to RC table.
The destination table (RC) is partitioned table, therefore, the dynamic partition is enabled. At the end of the insert into
command I have multiple small files in each partition... I tried to set few Hive parameters for merge, but the result is more or less the same...
The only thing that worked for me is when I added order by [any column]
to insert into
command In that case, there is a reduce process which cause to have eventually a single file in each partition.
This is like ugly workaround, I'm looking for a more elegant way.
Any suggestions?
Thanks