This paper proposes a data compression scheme for minimizing memory traffic in processor-based systems. Data compression and decompression are performed on-the-fly on the cache-to-memory path, that is, uncompressed cache lines are compressed before they are written back to main memory, and decompressed when cache refills take place. The distinguishing feature of the presented solution is its ability of providing high memory traffic reductions without requiring data profiling information. In other words, thanks to the self-learning mechanism it implements, the proposed scheme performs very closely to special-purpose compression approaches, whose main limitation is their inapplicability when off-line data profiling is not feasible. Memory tra...
<p>Technological improvements in integrated circuits have for a long time allowed the performance of...
Technological improvements in integrated circuits have for a longtime allowed the performance of com...
The performance gap between computer processors and memory bandwidth is severely limiting the throug...
Storing data in compressed form is becoming common practice in high-performance systems, where memor...
Abstract—Storing data in compressed form is becoming common practice in high-performance systems, wh...
This paper describes implementation details of a hardware compression and decompression unit (CDU) f...
A challenge in the design of high performance computer systems is how to transferdata efficiently be...
This synthesis lecture presents the current state-of-the-art in applying low-latency, lossless hardw...
The memory system stores information comprising primarily instructions and data and secondarily addr...
A challenge in the design of high performance computer systems is how to transfer data efficiently b...
This article describes a new software-based on-line memory compression algorithm for embedded system...
Compressed representations of programs can be used to improve the code density in embedded systems. ...
This paper describes how profile-driven data compression, a very effective approach to reduce memory...
Abstract — Chip Multiprocessors (CMPs) combine multiple cores on a single die, typically with privat...
This article describes a new software-based on-line memory com-pression algorithm for embedded syste...
<p>Technological improvements in integrated circuits have for a long time allowed the performance of...
Technological improvements in integrated circuits have for a longtime allowed the performance of com...
The performance gap between computer processors and memory bandwidth is severely limiting the throug...
Storing data in compressed form is becoming common practice in high-performance systems, where memor...
Abstract—Storing data in compressed form is becoming common practice in high-performance systems, wh...
This paper describes implementation details of a hardware compression and decompression unit (CDU) f...
A challenge in the design of high performance computer systems is how to transferdata efficiently be...
This synthesis lecture presents the current state-of-the-art in applying low-latency, lossless hardw...
The memory system stores information comprising primarily instructions and data and secondarily addr...
A challenge in the design of high performance computer systems is how to transfer data efficiently b...
This article describes a new software-based on-line memory compression algorithm for embedded system...
Compressed representations of programs can be used to improve the code density in embedded systems. ...
This paper describes how profile-driven data compression, a very effective approach to reduce memory...
Abstract — Chip Multiprocessors (CMPs) combine multiple cores on a single die, typically with privat...
This article describes a new software-based on-line memory com-pression algorithm for embedded syste...
<p>Technological improvements in integrated circuits have for a long time allowed the performance of...
Technological improvements in integrated circuits have for a longtime allowed the performance of com...
The performance gap between computer processors and memory bandwidth is severely limiting the throug...