There has been intensive research on data prefetching focusing on performance improvement, however, the energy aspect of prefetching is relatively unknown. Our experiments show that although software prefetching tends to be more energy efficient, hardware prefetching outperforms software prefetching on most of the applications in terms of performance. This paper proposes several techniques to make hardware-based data prefetching power-aware. Our proposed techniques include three compiler-based approaches which make the prefetch predictor more power efficient. The compiler identifies the pattern of memory accesses in order to selectively apply different prefetching schemes depending on predicted access patterns and to filter out unnecessary ...
Performance-enhancement techniques improve CPU speed, but at higher cost to other valuable system re...
Energy efficiency is becoming a major constraint in processor designs. Every component of the proces...
Ever increasing memory latencies and deeper pipelines push memory farther from the processor. Prefet...
Prefetching has emerged as one of the most successful techniques to bridge the gap between modern pr...
Extensive research has been done in prefetching techniques that hide memory latency in microprocesso...
As data prefetching is used in embedded processors, it is crucial to reduce the wasted energy for im...
Processor performance has increased far faster than memories have been able to keep up with, forcing...
Recent technological advances are such that the gap between processor cycle times and memory cycle t...
An energy-efficient architecture should jointly optimize energy consumption and throughput, as captu...
Despite large caches, main-memory access latencies still cause significant performance losses in man...
A major performance limiter in modern processors is the long latencies caused by data cache misses. ...
Prefetching, i.e., exploiting the overlap of processor com-putations with data accesses, is one of s...
The end of Dennard scaling has brought energy savings to the forefront of processor design. When cou...
As the trends of process scaling make memory system even more crucial bottleneck, the importance of ...
Abstract. Given the increasing gap between processors and memory, prefetching data into cache become...
Performance-enhancement techniques improve CPU speed, but at higher cost to other valuable system re...
Energy efficiency is becoming a major constraint in processor designs. Every component of the proces...
Ever increasing memory latencies and deeper pipelines push memory farther from the processor. Prefet...
Prefetching has emerged as one of the most successful techniques to bridge the gap between modern pr...
Extensive research has been done in prefetching techniques that hide memory latency in microprocesso...
As data prefetching is used in embedded processors, it is crucial to reduce the wasted energy for im...
Processor performance has increased far faster than memories have been able to keep up with, forcing...
Recent technological advances are such that the gap between processor cycle times and memory cycle t...
An energy-efficient architecture should jointly optimize energy consumption and throughput, as captu...
Despite large caches, main-memory access latencies still cause significant performance losses in man...
A major performance limiter in modern processors is the long latencies caused by data cache misses. ...
Prefetching, i.e., exploiting the overlap of processor com-putations with data accesses, is one of s...
The end of Dennard scaling has brought energy savings to the forefront of processor design. When cou...
As the trends of process scaling make memory system even more crucial bottleneck, the importance of ...
Abstract. Given the increasing gap between processors and memory, prefetching data into cache become...
Performance-enhancement techniques improve CPU speed, but at higher cost to other valuable system re...
Energy efficiency is becoming a major constraint in processor designs. Every component of the proces...
Ever increasing memory latencies and deeper pipelines push memory farther from the processor. Prefet...