We were using zipinputstream for reading the zip files present in hdfs.But now we had password protected zip files.Is there any way to decode the zip files in hdfs only,before passing it to zipinputstream?There is a java library zip4j,which works with local file system only,but since we had huge amount of files,we want to decode and open the files in hdfs only.Is there any way to achieve that?
Asked
Active
Viewed 435 times
4
-
this may help http://stackoverflow.com/questions/36180597/how-to-unzip-files-stored-in-hdfs-using-java-without-first-copying-to-the-local – Tahir Hussain Mir Dec 25 '16 at 14:39
-
and this http://blog.matthewrathbone.com/2013/12/28/Reading-data-from-HDFS-even-if-it-is-compressed.html – Tahir Hussain Mir Dec 25 '16 at 14:41
-
@TahirHussainMir-Thanks but I had already done that stuff.With normal zip files,it works fine,the problem is I had password protected zip files. – mahan07 Dec 26 '16 at 04:46
-
This post content might help you, have alook http://stackoverflow.com/questions/19738929/processing-password-protected-zip-files-using-mapreduce – BruceWayne Dec 30 '16 at 04:42