Modern Graphics Processing Units (GPUs) provide much higher off-chip memory bandwidth than CPUs, but many GPU applications are still limited by memory bandwidth. Unfortunately, off-chip memory bandwidth is growing slower than the number of cores and has become a performance bottleneck. Thus, optimizations of effective memory bandwidth play a significant role for scaling the performance of GPUs. Memory compression is a promising approach for improving memory bandwidth which can translate into higher performance and energy efficiency. However, compression is not free and its challenges need to be addressed, otherwise the benefits of compression may be offset by its overhead. We propose an entropy encoding based memory compression (E2MC) techn...
Compute-intensive tasks in high-end high performance computing (HPC) systems often generate large a...
Query co-processing on graphics processors (GPUs) has become an effective means to improve the perfo...
Error-bounded lossy compression is a state-of-the-art data reduction technique for HPC applications ...
Modern Graphics Processing Units (GPUs) provide much higher off-chip memory bandwidth than CPUs, but...
Today’s high-performance computing (HPC) applications are producing vast volumes of data, which are ...
Memory compression is a promising approach for reducing memory bandwidth requirements and increasing...
We present parallel algorithms and implementations of a bzip2-like lossless data compression scheme ...
In this thesis we present parallel algorithms and implementations of a bzip2-like lossless data comp...
Modern data-intensive computing forces system designers to deliver good performance under several ma...
Modern Graphics Processing Units (GPUs) are well provi-sioned to support the concurrent execution of...
Memory bandwidth compression can be an effective way to achieve higher system performance and energy...
GPU memory systems adopt a multi-dimensional hardware structure to provide the bandwidth necessary t...
JPEG XS is a new standard for low-latency and low-complexity coding designed by the JPEG committee. ...
Many important client and data-center applications need large memory capacity and high memory bandwi...
Abstract—Memory bandwidth compression can be an effective way to achieve higher system performance a...
Compute-intensive tasks in high-end high performance computing (HPC) systems often generate large a...
Query co-processing on graphics processors (GPUs) has become an effective means to improve the perfo...
Error-bounded lossy compression is a state-of-the-art data reduction technique for HPC applications ...
Modern Graphics Processing Units (GPUs) provide much higher off-chip memory bandwidth than CPUs, but...
Today’s high-performance computing (HPC) applications are producing vast volumes of data, which are ...
Memory compression is a promising approach for reducing memory bandwidth requirements and increasing...
We present parallel algorithms and implementations of a bzip2-like lossless data compression scheme ...
In this thesis we present parallel algorithms and implementations of a bzip2-like lossless data comp...
Modern data-intensive computing forces system designers to deliver good performance under several ma...
Modern Graphics Processing Units (GPUs) are well provi-sioned to support the concurrent execution of...
Memory bandwidth compression can be an effective way to achieve higher system performance and energy...
GPU memory systems adopt a multi-dimensional hardware structure to provide the bandwidth necessary t...
JPEG XS is a new standard for low-latency and low-complexity coding designed by the JPEG committee. ...
Many important client and data-center applications need large memory capacity and high memory bandwi...
Abstract—Memory bandwidth compression can be an effective way to achieve higher system performance a...
Compute-intensive tasks in high-end high performance computing (HPC) systems often generate large a...
Query co-processing on graphics processors (GPUs) has become an effective means to improve the perfo...
Error-bounded lossy compression is a state-of-the-art data reduction technique for HPC applications ...