I have multiple small parquet
files generated as output of hive ql job, i would like to merge the output files to single parquet file?
what is the best way to do it using some hdfs or linux commands
?
we used to merge the text files using cat
command, but will this work for parquet as well?
Can we do it using HiveQL
itself when writing output files like how we do it using repartition
or coalesc
method in spark
?