I have a directory that contains lots of files and sub directories that I want to compress and export from hdfs to fs.
I came across this question - Hadoop: compress file in HDFS? , but it seems like it's relevant only to files, and using hadoop-streaming and the GzipCodec gave me no success with directories.
What is the most efficient why to compress HDFS folder into single gzip file?
Thanks in advance.