I used LZO to compress reduce output. I tried this: Hadoop-LZO project of Kevin Weil and then used LzoCodec
class with my job:
TextOutputFormat.setOutputCompressorClass(job, LzoCodec.class);
Now compression works just fine.
My problem is that compression result is a .lzo_deflate
file which I just can't decompress.
Lzop utility doesn't seem to support that type of file.
LzopCodec
is supposed to give a .lzo
file, but it did not work, however it's in th same package as LzoCodec
(org.apache.hadoop.io.compress
) which may refer to a compatibility issue, since I used the old API (0.19) to make compression works.
Answers to this question suggest Python solutions, however I need it in Java.
I'm using Hadoop 1.1.2 and Java 6.