Abstract—High-end embedded processors demand complex on-chip cache hierarchies satisfying several contradicting design requirements such as high-performance operation and low energy consumption. This paper introduces light-power (LP) nonuniform cache architecture (NUCA), a tiled-cache addressing both goals. LP-NUCA places a group of small and low-latency tiles between the L1 and the last level cache (LLC) that adapt better to the application working sets and keep most recently evicted blocks close to L1. LP-NUCA is built around three specialized “net-works-in-cache, ” each aimed at a separate cache operation. To prove the design feasibility, we have fully implemented LP-NUCA in a 90-nm technology. From the VLSI implementation, we observe th...
Global interconnect becomes the delay bottleneck in microprocessor designs, and latency for large on...
This paper proposes a non-uniform cache architecture for reducing the power consumption of memory sy...
Non-uniform cache architectures (NUCAs) are a novel design paradigm for large last-level on-chip cac...
Abstract—High-end embedded processors demand complex on-chip cache hierarchies satisfying several co...
To deal with the “memory wall” problem, microprocessors include large secondary on-chip caches. But ...
Abstract—To deal with the “memory wall ” problem, micro-processors include large secondary on-chip c...
Future embedded applications will require high performance processors integrating fast and low-power...
Future embedded applications will require high performance processors integrating fast and low-power...
Journal ArticleModern processors dedicate more than half their chip area to large L2 and L3 caches ...
In the last years, embedded systems have evolved so that they offer capabilities we could only find ...
Portable devices often demand powerful processors to run computing intensive applications, such as v...
The last level on-chip cache (LLC) is becoming bigger and more complex to effectively support the va...
Abstract: Non-uniform cache architecture (NUCA) aims to limit the wire-delay problem typical of lar...
Cache working-set adaptation is key as embedded systems move to multiprocessor and Simultaneous Mult...
ABSTRACT NUCA caches are large L2 on-chip cache memories characterized by multi-bank partitioning a...
Global interconnect becomes the delay bottleneck in microprocessor designs, and latency for large on...
This paper proposes a non-uniform cache architecture for reducing the power consumption of memory sy...
Non-uniform cache architectures (NUCAs) are a novel design paradigm for large last-level on-chip cac...
Abstract—High-end embedded processors demand complex on-chip cache hierarchies satisfying several co...
To deal with the “memory wall” problem, microprocessors include large secondary on-chip caches. But ...
Abstract—To deal with the “memory wall ” problem, micro-processors include large secondary on-chip c...
Future embedded applications will require high performance processors integrating fast and low-power...
Future embedded applications will require high performance processors integrating fast and low-power...
Journal ArticleModern processors dedicate more than half their chip area to large L2 and L3 caches ...
In the last years, embedded systems have evolved so that they offer capabilities we could only find ...
Portable devices often demand powerful processors to run computing intensive applications, such as v...
The last level on-chip cache (LLC) is becoming bigger and more complex to effectively support the va...
Abstract: Non-uniform cache architecture (NUCA) aims to limit the wire-delay problem typical of lar...
Cache working-set adaptation is key as embedded systems move to multiprocessor and Simultaneous Mult...
ABSTRACT NUCA caches are large L2 on-chip cache memories characterized by multi-bank partitioning a...
Global interconnect becomes the delay bottleneck in microprocessor designs, and latency for large on...
This paper proposes a non-uniform cache architecture for reducing the power consumption of memory sy...
Non-uniform cache architectures (NUCAs) are a novel design paradigm for large last-level on-chip cac...