While data filter caches (DFCs) have been shown to be effective at reducing data access energy, they have not been adopted in processors due to the associated performance penalty caused by high DFC miss rates. In this article, we present a design that both decreases the DFC miss rate and completely eliminates the DFC performance penalty even for a level-one data cache (L1 DC) with a single cycle access time. First, we show that a DFC that lazily fills each word in a DFC line from an L1 DC only when the word is referenced is more energy-efficient than eagerly filling the entire DFC line. For a 512B DFC, we are able to eliminate loads of words into the DFC that are never referenced before being evicted, which occurred for about 75% of the wor...
L1 data caches in high-performance processors continue to grow in set associativity. Higher associat...
In most modern processor designs the L1 data cache has become a major consumer of power due to its i...
Level-one data cache (L1 DC) and data translation lookaside buffer (DTLB) accesses impact energy usa...
As CPU data requests to the level-one (L1) data cache (DC) can represent as much as 25% of an embedd...
Conventional Data Filter Cache (DFC) designs improve processor energy efficiency, but degrade perfor...
As CPU data requests to the level-one (L1) data cache (DC) can represent as much as 25 % of an embed...
Conventional data filter caches (DFCs) improve processor energy efficiency, but degrade performance....
Energy efficiency is one of the key metrics in the design of a wide range of processor types. For ex...
The number of battery powered devices is growing significantly and these devices require energy-effi...
The first level data cache in modern processors has become a major consumer of energy due to its inc...
In recent years, CPU performance has become energy constrained. If performance is to continue increa...
Minimizing power, increasing performance, and delivering effective memory bandwidth are today's prim...
High-performance processors use a large set–associative L1 data cache with multiple ports. As clock ...
As the performance gap between the processor cores and the memory subsystem increases, designers are...
Memory operations have a significant impact on both performance and energy usage even when an access...
L1 data caches in high-performance processors continue to grow in set associativity. Higher associat...
In most modern processor designs the L1 data cache has become a major consumer of power due to its i...
Level-one data cache (L1 DC) and data translation lookaside buffer (DTLB) accesses impact energy usa...
As CPU data requests to the level-one (L1) data cache (DC) can represent as much as 25% of an embedd...
Conventional Data Filter Cache (DFC) designs improve processor energy efficiency, but degrade perfor...
As CPU data requests to the level-one (L1) data cache (DC) can represent as much as 25 % of an embed...
Conventional data filter caches (DFCs) improve processor energy efficiency, but degrade performance....
Energy efficiency is one of the key metrics in the design of a wide range of processor types. For ex...
The number of battery powered devices is growing significantly and these devices require energy-effi...
The first level data cache in modern processors has become a major consumer of energy due to its inc...
In recent years, CPU performance has become energy constrained. If performance is to continue increa...
Minimizing power, increasing performance, and delivering effective memory bandwidth are today's prim...
High-performance processors use a large set–associative L1 data cache with multiple ports. As clock ...
As the performance gap between the processor cores and the memory subsystem increases, designers are...
Memory operations have a significant impact on both performance and energy usage even when an access...
L1 data caches in high-performance processors continue to grow in set associativity. Higher associat...
In most modern processor designs the L1 data cache has become a major consumer of power due to its i...
Level-one data cache (L1 DC) and data translation lookaside buffer (DTLB) accesses impact energy usa...