Skip to main content

Pbzip2

PBZIP2 is a parallel implementation of the bzip2 block-sorting file compressor that uses pthreads and achieves near-linear speedup on SMP machines. The output of this version is fully compatible with bzip2 v1.0.2 or newer (ie: anything compressed with pbzip2 can be decompressed with bzip2). PBZIP2 should work on any system that has a pthreads compatible C++ compiler (such as gcc).

Documentation

Supported Platforms and Architectures

  • linux-x64
  • linux-arm64

Package Dependencies

The following package dependencies are required to be installed on the Unix/Linux system in order to support the requirements of the Pbzip2 workload. Note that the Virtual Client will handle the installation of any required dependencies.

  • unzip
  • pbzip2

Workload Usage

usage: pbzip2 [ -123456789 ] [ -b#cdfhklm#p#qrS#tvVz ] [ filenames ... ]

Options:
-b#
Where # is block size in 100k steps (default 9 = 900k)
-c, --stdout
Output to standard out (stdout)
-d,--decompress
Decompress file
-f,--force
Force, overwrite existing output file
-h,--help
Print this help message
-k,--keep
Keep input file, do not delete
-l,--loadavg
Load average determines max number processors to use
-m#
Where # is max memory usage in 1MB steps (default 100 = 100MB)
-p#
Where # is the number of processors (default: autodetect)
-q,--quiet
Quiet mode (default)
-r,--read
Read entire input file into RAM and split between processors
-S#
Child thread stack size in 1KB steps (default stack size if unspecified)
-t,--test
Test compressed file integrity
-v,--verbose
Verbose mode
-V
Display version info for pbzip2 then exit
-z,--compress
Compress file (default)
-1,--fast ... -9,--best
Set BWT block size to 100k .. 900k (default 900k).
--ignore-trailing-garbage=#
Ignore trailing garbage flag (1 - ignored; 0 - forbidden)
If no file names are given, pbzip2 compresses or decompresses from standard input to standard output.

Example usage: pbzip2 -b15k myfile.tar
pbzip2 -p4 -r -5 myfile.tar second*.txt

What is Being Tested?

Pbzip2 is used to measure performance in terms of compressionTime, and ratio of compressed size and original size in case of compression and ratio of decompressed size and original size in case of compression. Below are the metrics measured by Pbzip2 Workload.

NameUnit
CompressionTimeseconds
Compressed size and Original size ratio-
Decompressed size and Original size ratio-

References