As data prefetching is used in embedded processors, it is crucial to reduce the wasted energy for improving the energy efficiency. In this paper, we propose an adaptive prefetch filtering (APF) mechanism to reduce the wasted bandwidth and energy as well as the cache pollution caused by useless prefetches. APF records the prefetch-victim address pairs of issued prefetches and collects information about which address in each pair is first accessed by the processor to guide the filtering of new generated useless prefetches. Meanwhile, filtered prefetches are recorded for building the feedback mechanism to avoid filtering useful prefetches. Experimental results demonstrate that APF reduces useless prefetches by an average of 53.81% with a mere ...
The full text of this article is not available on SOAR. WSU users can access the article via IEEE Xp...
In this paper, we present our design of a high performance prefetcher, which exploits various locali...
International audienceIn multi-core systems, an application's prefetcher can interfere with the memo...
There has been intensive research on data prefetching focusing on performance improvement, however, ...
Prefetching has emerged as one of the most successful techniques to bridge the gap between modern pr...
High performance processors employ hardware data prefetching to reduce the negative performance impa...
Extensive research has been done in prefetching techniques that hide memory latency in microprocesso...
International audienceData prefetching is an effective way to bridge the increasing performance gap ...
An energy-efficient architecture should jointly optimize energy consumption and throughput, as captu...
Processor performance has increased far faster than memories have been able to keep up with, forcing...
[EN] Current multicore systems implement multiple hardware prefetchers to tolerate long main memory ...
A well known performance bottleneck in computer architecture is the so-called memory wall. This term...
As the trends of process scaling make memory system even more crucial bottleneck, the importance of ...
Abstract. Given the increasing gap between processors and memory, prefetching data into cache become...
Energy efficiency is becoming a major constraint in processor designs. Every component of the proces...
The full text of this article is not available on SOAR. WSU users can access the article via IEEE Xp...
In this paper, we present our design of a high performance prefetcher, which exploits various locali...
International audienceIn multi-core systems, an application's prefetcher can interfere with the memo...
There has been intensive research on data prefetching focusing on performance improvement, however, ...
Prefetching has emerged as one of the most successful techniques to bridge the gap between modern pr...
High performance processors employ hardware data prefetching to reduce the negative performance impa...
Extensive research has been done in prefetching techniques that hide memory latency in microprocesso...
International audienceData prefetching is an effective way to bridge the increasing performance gap ...
An energy-efficient architecture should jointly optimize energy consumption and throughput, as captu...
Processor performance has increased far faster than memories have been able to keep up with, forcing...
[EN] Current multicore systems implement multiple hardware prefetchers to tolerate long main memory ...
A well known performance bottleneck in computer architecture is the so-called memory wall. This term...
As the trends of process scaling make memory system even more crucial bottleneck, the importance of ...
Abstract. Given the increasing gap between processors and memory, prefetching data into cache become...
Energy efficiency is becoming a major constraint in processor designs. Every component of the proces...
The full text of this article is not available on SOAR. WSU users can access the article via IEEE Xp...
In this paper, we present our design of a high performance prefetcher, which exploits various locali...
International audienceIn multi-core systems, an application's prefetcher can interfere with the memo...