This paper describes Approximate Value Reconstruction (AVR), an architecture for approximate memory compression. AVR reduces the memory traffic of applications that tolerate approximations in their dataset. Thereby, it utilizes more efficiently off-chip bandwidth improving significantly system performance and energy efficiency. AVR compresses memory blocks using low latency downsampling that exploits similarities between neighboring values and achieves aggressive compression ratios, up to 16:1 in our implementation. The proposed AVR architecture supports our compression scheme maximizing its effect and minimizing its overheads by (i) co-locating in the Last Level Cache (LLC) compressed and uncompressed data, (ii) efficiently handling LLC ev...
Preserving memory locality is a major issue in highly-multithreaded architectures such as GPUs. Thes...
pre-printMemory compression has been proposed and deployed in the past to grow the capacity of a mem...
Memory compression is a promising approach for reducing memory bandwidth requirements and increasing...
Memory bandwidth is a critical resource in modern systems and has an increasing demand. The large nu...
Main memory is a critical resource in modern computer systems and is in increasing demand. An increa...
A key challenge in modern computing systems is to access data fast enough to fully utilize the compu...
Many important client and data-center applications need large memory capacity and high memory bandwi...
University of Minnesota M.S.E.E. thesis. November 2015. Major: Electrical Engineering. Advisor: Joh...
In this paper we introduce L2C, a hybrid lossy/lossless compression scheme applicable both to the me...
Approximate computing recognizes that many applications can tolerate inexactness. These applications...
A challenge in the design of high performance computer systems is how to transfer data efficiently b...
The performance gap between computer processors and memory bandwidth is severely limiting the throug...
Enabled by technology scaling, processing parallelism has been continuously increased to meet the de...
We investigate the feasibility of using instruction compression at some level in a multi-level memor...
Abstract—Approximate computing explores opportunities that emerge when applications can tolerate err...
Preserving memory locality is a major issue in highly-multithreaded architectures such as GPUs. Thes...
pre-printMemory compression has been proposed and deployed in the past to grow the capacity of a mem...
Memory compression is a promising approach for reducing memory bandwidth requirements and increasing...
Memory bandwidth is a critical resource in modern systems and has an increasing demand. The large nu...
Main memory is a critical resource in modern computer systems and is in increasing demand. An increa...
A key challenge in modern computing systems is to access data fast enough to fully utilize the compu...
Many important client and data-center applications need large memory capacity and high memory bandwi...
University of Minnesota M.S.E.E. thesis. November 2015. Major: Electrical Engineering. Advisor: Joh...
In this paper we introduce L2C, a hybrid lossy/lossless compression scheme applicable both to the me...
Approximate computing recognizes that many applications can tolerate inexactness. These applications...
A challenge in the design of high performance computer systems is how to transfer data efficiently b...
The performance gap between computer processors and memory bandwidth is severely limiting the throug...
Enabled by technology scaling, processing parallelism has been continuously increased to meet the de...
We investigate the feasibility of using instruction compression at some level in a multi-level memor...
Abstract—Approximate computing explores opportunities that emerge when applications can tolerate err...
Preserving memory locality is a major issue in highly-multithreaded architectures such as GPUs. Thes...
pre-printMemory compression has been proposed and deployed in the past to grow the capacity of a mem...
Memory compression is a promising approach for reducing memory bandwidth requirements and increasing...