Energy efficiency is becoming a major constraint in processor designs. Every component of the processor should be reconsidered to reduce wasted energy and area. Prefetching is an important technique for tolerating memory latency. Prefetcher designs have important impact on the energy efficiency of the memory hierarchy. Stride prefetchers require little storage, but cannot handle irregular access patterns. Delta correlation (DC) prefetchers can handle complicated access patterns, but waste storage because of storing multiple miss addresses for a stride pattern. Moreover, DC prefetchers waste the bandwidth and energy of the memory hierarchy because they cannot identify whether an address has been prefetched and generate a large number of redu...
Data prefetching is a technique that plays a crucial role in modern high-performance processors by h...
The end of Dennard scaling has brought energy savings to the forefront of processor design. When cou...
An energy-efficient architecture should jointly optimize energy consumption and throughput, as captu...
Prior work in hardware prefetching has focused mostly on either predicting regular streams with unif...
As data prefetching is used in embedded processors, it is crucial to reduce the wasted energy for im...
There has been intensive research on data prefetching focusing on performance improvement, however, ...
Prefetching has emerged as one of the most successful techniques to bridge the gap between modern pr...
In the last century great progress was achieved in developing processors with extremely high computa...
Processor performance has increased far faster than memories have been able to keep up with, forcing...
High performance processors employ hardware data prefetching to reduce the negative performance impa...
Recent technological advances are such that the gap between processor cycle times and memory cycle t...
Extensive research has been done in prefetching techniques that hide memory latency in microprocesso...
Abstract—Data prefetching of regular access patterns is an effective mechanism to hide the memory la...
Achieving low load-to-use latency with low energy and storage overheads is critical for performance....
A well known performance bottleneck in computer architecture is the so-called memory wall. This term...
Data prefetching is a technique that plays a crucial role in modern high-performance processors by h...
The end of Dennard scaling has brought energy savings to the forefront of processor design. When cou...
An energy-efficient architecture should jointly optimize energy consumption and throughput, as captu...
Prior work in hardware prefetching has focused mostly on either predicting regular streams with unif...
As data prefetching is used in embedded processors, it is crucial to reduce the wasted energy for im...
There has been intensive research on data prefetching focusing on performance improvement, however, ...
Prefetching has emerged as one of the most successful techniques to bridge the gap between modern pr...
In the last century great progress was achieved in developing processors with extremely high computa...
Processor performance has increased far faster than memories have been able to keep up with, forcing...
High performance processors employ hardware data prefetching to reduce the negative performance impa...
Recent technological advances are such that the gap between processor cycle times and memory cycle t...
Extensive research has been done in prefetching techniques that hide memory latency in microprocesso...
Abstract—Data prefetching of regular access patterns is an effective mechanism to hide the memory la...
Achieving low load-to-use latency with low energy and storage overheads is critical for performance....
A well known performance bottleneck in computer architecture is the so-called memory wall. This term...
Data prefetching is a technique that plays a crucial role in modern high-performance processors by h...
The end of Dennard scaling has brought energy savings to the forefront of processor design. When cou...
An energy-efficient architecture should jointly optimize energy consumption and throughput, as captu...