We present parallel algorithms and implementations of a bzip2-like lossless data compression scheme for GPU architectures. Our approach parallelizes three main stages in the bzip2 compression pipeline: Burrows-Wheeler transform (BWT), move-to-front transform (MTF), and Huffman coding. In particular, we utilize a two-level hierarchical sort for BWT, design a novel scan-based parallel MTF algorithm, and implement a parallel reduction scheme to build the Huffman tree. For each algorithm, we perform detailed performance analysis, discuss its strengths and weaknesses, and suggest future directions for improvements. Overall, our GPU implementation is dominated by BWT performance and is 2.78x slower than bzip2, with BWT and MTF-Huffman respectivel...
In this paper, we present an all-core implementation of Burrows Wheeler Compression algorithm that e...
Abstract. Nowadays, the use of digital communication systems has increased in such a way that networ...
Data movement has long been identified as the biggest challenge facing modern computer systems' desi...
In this thesis we present parallel algorithms and implementations of a bzip2-like lossless data comp...
Today’s high-performance computing (HPC) applications are producing vast volumes of data, which are ...
This thesis report is submitted in partial fulfilment of the requirements for the degree of Bachelor...
Lossless data compression is a promising software approach for reducing the bandwidth requirements o...
A parallel implementation of the bzip2 block-sorting lossless compression program is described. The ...
Modern Graphics Processing Units (GPUs) provide much higher off-chip memory bandwidth than CPUs, but...
Approximation schemes for optimal compression with static and sliding dictionaries which can run on ...
This thesis deals with Burrows-Wheeler transform (BWT) and possibilities of acceleration of this tra...
Approximation schemes for optimal compression with static and sliding dictionaries which can run on ...
Error-bounded lossy compression is a state-of-the-art data reduction technique for HPC applications ...
The paper presents two algorithms of parallelization of computations for lossless data compression. ...
Efficient image data compression algorithms are required to minimize the cost of data transmission a...
In this paper, we present an all-core implementation of Burrows Wheeler Compression algorithm that e...
Abstract. Nowadays, the use of digital communication systems has increased in such a way that networ...
Data movement has long been identified as the biggest challenge facing modern computer systems' desi...
In this thesis we present parallel algorithms and implementations of a bzip2-like lossless data comp...
Today’s high-performance computing (HPC) applications are producing vast volumes of data, which are ...
This thesis report is submitted in partial fulfilment of the requirements for the degree of Bachelor...
Lossless data compression is a promising software approach for reducing the bandwidth requirements o...
A parallel implementation of the bzip2 block-sorting lossless compression program is described. The ...
Modern Graphics Processing Units (GPUs) provide much higher off-chip memory bandwidth than CPUs, but...
Approximation schemes for optimal compression with static and sliding dictionaries which can run on ...
This thesis deals with Burrows-Wheeler transform (BWT) and possibilities of acceleration of this tra...
Approximation schemes for optimal compression with static and sliding dictionaries which can run on ...
Error-bounded lossy compression is a state-of-the-art data reduction technique for HPC applications ...
The paper presents two algorithms of parallelization of computations for lossless data compression. ...
Efficient image data compression algorithms are required to minimize the cost of data transmission a...
In this paper, we present an all-core implementation of Burrows Wheeler Compression algorithm that e...
Abstract. Nowadays, the use of digital communication systems has increased in such a way that networ...
Data movement has long been identified as the biggest challenge facing modern computer systems' desi...