This paper exploits small-value locality to accelerate the execution of memory instructions. We find that narrow-width loads (NWLDs) — loads with small-value operands of 8 bits or less — comprise 26% of all executed loads across 40 applications of the SPEC benchmark suites. We establish that the frequency of NWLDs are almost independent of compiler and input data. We introduce narrow-width caches (NWC) to cache small-value memory words. NWCs provide a significant speedup for several memory-intensive applications with a negligible chip-area overhead. NWCs also reduce the overall energy dissipation and memory traffic
The line size/performance trade-offs in off-chip second-level caches in light of energy-efficiency a...
Several studies have shown that cache memories account for more than 40% of the total energy consume...
Low utilization of on-chip cache capacity limits perfor-mance and wastes energy because of the long ...
This paper exploits small-value locality to accelerate the execution of memory instructions. We find...
Embedded environment imposes severe constraints of system resources on embedded applications. Perfor...
On-chip cache memories are instrumental in tackling several performance and energy issues facing con...
The need in low power processor design is growing due to the reliability problem for high frequency,...
Abstract—Due to the high cell density, low leakage power consumption, and less vulnerability to soft...
textOne of the major limiters to computer systems and systems on chip (SOC) designs is accessing the...
Execution efficiency of memory instructions remains critically important. To this end, a plethora of...
Low utilization of on-chip cache capacity limits performance and wastes energy because of the long l...
Many important applications perform computations on non-standard bit-width values; mapping them to ...
To satisfy the demand for higher performance, modern processors are designed with a high degree of s...
Abstract — In many computer systems, a large portion of the execution time and energy consumption is...
This paper introduces the abstract concept of value-aware caches, which exploit value locality rathe...
The line size/performance trade-offs in off-chip second-level caches in light of energy-efficiency a...
Several studies have shown that cache memories account for more than 40% of the total energy consume...
Low utilization of on-chip cache capacity limits perfor-mance and wastes energy because of the long ...
This paper exploits small-value locality to accelerate the execution of memory instructions. We find...
Embedded environment imposes severe constraints of system resources on embedded applications. Perfor...
On-chip cache memories are instrumental in tackling several performance and energy issues facing con...
The need in low power processor design is growing due to the reliability problem for high frequency,...
Abstract—Due to the high cell density, low leakage power consumption, and less vulnerability to soft...
textOne of the major limiters to computer systems and systems on chip (SOC) designs is accessing the...
Execution efficiency of memory instructions remains critically important. To this end, a plethora of...
Low utilization of on-chip cache capacity limits performance and wastes energy because of the long l...
Many important applications perform computations on non-standard bit-width values; mapping them to ...
To satisfy the demand for higher performance, modern processors are designed with a high degree of s...
Abstract — In many computer systems, a large portion of the execution time and energy consumption is...
This paper introduces the abstract concept of value-aware caches, which exploit value locality rathe...
The line size/performance trade-offs in off-chip second-level caches in light of energy-efficiency a...
Several studies have shown that cache memories account for more than 40% of the total energy consume...
Low utilization of on-chip cache capacity limits perfor-mance and wastes energy because of the long ...