Questions tagged [bzip2]

bzip2 is a Unix command used for compression and decompression of files. The main advantage of bzip2 is that it has a high compression ratio with reasonable speed.

bzip2 one of the most widely used free compression programs for the terminal.

It typically compresses files to within 10% to 15% of the best available techniques, whilst being around twice as fast at compression and six times faster at decompression.

The current version is 1.0.6, released 20 Sept 2010.

327 questions
297
votes
8 answers

Utilizing multi core for tar+gzip/bzip compression/decompression

I normally compress using tar zcvf and decompress using tar zxvf (using gzip due to habit). I've recently gotten a quad core CPU with hyperthreading, so I have 8 logical cores, and I notice that many of the cores are unused during…
user1118764
  • 9,255
  • 18
  • 61
  • 113
118
votes
16 answers

missing python bz2 module

I have installed at my home directory. [spatel@~ dev1]$ /home/spatel/python-2.7.3/bin/python -V Python 2.7.3 I am trying to run one script which required python 2.7.x version, and i am getting missing bz2 error [spatel@~ dev1]$…
Satish
  • 16,544
  • 29
  • 93
  • 149
38
votes
5 answers

Extract bz2 file in R

I have bunch of .csv.bz2 files, which i have to download, extract, and read in R. I downloaded the file and want to extract it to current working directory, then read it. unz(filename,filename.csv) but it does not seem to work. How can I do that? I…
Prabhu
  • 5,296
  • 4
  • 37
  • 45
31
votes
4 answers

Pyenv's python is missing bzip2 module

I used pyenv to install python 3.8.2 and to create a virtualenv. In the virtualenv, I used pipenv to install pandas. However, when importing pandas, I'm getting the following: [...] File…
luislhl
  • 1,136
  • 1
  • 10
  • 23
29
votes
4 answers

Best splittable compression for Hadoop input = bz2?

We've realized a bit too late that archiving our files in GZip format for Hadoop processing isn't such a great idea. GZip isn't splittable, and for reference, here are the problems which I won't repeat: Very basic question about Hadoop and…
Suman
  • 9,221
  • 5
  • 49
  • 62
28
votes
3 answers

How to use awk for a compressed file

How can I change the following command for a compressed file? awk 'FNR==NR { array[$1,$2]=$8; next } ($1,$2) in array { print $0 ";" array[$1,$2] }' input1.vcf input2.vcf The command working fine with normal file. I need to change the command for…
AKR
  • 359
  • 1
  • 5
  • 11
26
votes
7 answers

No module named '_bz2' in python3

When trying to execute the following command: import matplotlib.pyplot as plt The following error occurs: from _bz2 import BZ2Compressor, BZ2Decompressor ImportError: No module named '_bz2' So, I was trying to install bzip2 module in Ubuntu…
25
votes
2 answers

Uncompress BZIP2 archive

I can uncompress zip, gzip, and rar files, but I also need to uncompress bzip2 files as well as unarchive them (.tar). I haven't come across a good library to use. I am using Java along with Maven so ideally, I'd like to include it as a dependency…
Walter White
24
votes
5 answers

How to protect myself from a gzip or bzip2 bomb?

This is related to the question about zip bombs, but having gzip or bzip2 compression in mind, e.g. a web service accepting .tar.gz files. Python provides a handy tarfile module that is convenient to use, but does not seem to provide protection…
Joachim Breitner
  • 25,395
  • 6
  • 78
  • 139
22
votes
2 answers

Multistream Wikipedia dump

I downloaded the german wikipedia dump dewiki-20151102-pages-articles-multistream.xml. My short question is: What does the 'multistream' mean in this case?
m4ri0
  • 597
  • 1
  • 6
  • 10
21
votes
1 answer

How to build boost iostreams with gzip and bzip2 support on Windows

How do I build boost's iostreams library with gzip and bzip2 support?
Cookie
  • 12,004
  • 13
  • 54
  • 83
17
votes
4 answers

Linux search text string from .bz2 files recursively in subdirectories

I have a case where multiple .bz2 files are situated in subdirectories. And I want to search for a text, from all files, using bzcat and grep command linux commands. I am able to search one-one file by using the following command: bzcat…
Semu
  • 223
  • 1
  • 2
  • 8
14
votes
1 answer

Boost iostreams with bzip - unresolved symbols

My project was using an older version of Boost's iostreams w/ bzip2. I'm now trying to upgrade to Boost 1.51. At first I did not compile with bzip so obviously I got the linker yelling about libboost_bzip2-vc100-mt-sgd-1_51.lib being missing. I then…
E.K.
  • 321
  • 1
  • 11
13
votes
3 answers

'bz2 is module not available' when installing Pandas with pip in python virtual environment

I am going through this post Numpy, Scipy, and Pandas - Oh My!, installing some python packages, but got stuck at the line for installing Pandas: pip install -e git+https://github.com/pydata/pandas#egg=pandas I changed 'wesm' to 'pydata' for the…
qAp
  • 1,139
  • 2
  • 12
  • 26
11
votes
3 answers

Calculate/validate bz2 (bzip2) CRC32 in Python

I'm trying to calculate/validate the CRC32 checksums for compressed bzip2 archives. .magic:16 = 'BZ' signature/magic number .version:8 = 'h' for Bzip2 ('H'uffman coding) .hundred_k_blocksize:8 =…
soulseekah
  • 8,770
  • 3
  • 53
  • 58
1
2 3
21 22