Journal ArticleAlthough microprocessor performance continues to increase at a rapid pace, the growing processor-memory speed gap threatens to limit future performance gains. In this paper, we propose a novel configurable cache and TLB as an alternative to conventional two-level hierarchies. This organization leverages repeater insertion to provide low-cost configurability of size and speed. A novel configuration management algorithm dynamically measures hit and miss intolerance over intervals of instruction execution in order to tailor the cache and TLB organizations on-the-fly to improve memory hierarchy performance. The result is an average 14% improvement in IPC and a speedup of up to 1.55 across a broad class of applications compared...
Journal ArticleThe speed gap between processors and memory system is becoming the performance bottle...
On multi-core processors, contention on shared resources such as the last level cache (LLC) and memo...
Minimizing power, increasing performance, and delivering effective memory bandwidth are today's prim...
Journal ArticleConventional microarchitectures choose a single memory hierarchy design point target...
Journal ArticleThe widespread use of repeaters in long wires creates the possibility of dynamically ...
In modern computers, memory hierarchies play a paramount role in improving the average execution tim...
To design computers which reach the performance limits of the implementation technology, one must un...
Current microprocessors improve performance by exploiting instruction-level parallelism (ILP). ILP h...
With the increasing gap between processor speed and memory speed, a sophisticated memory hierarchy i...
The memory hierarchy is predicted to consume up to 40% to 70% of total system power in future data c...
PhD ThesisCurrent microprocessors improve performance by exploiting instruction-level parallelism (I...
The increasing levels of transistor density have enabled integration of an increasing number of core...
textOne of the major limiters to computer system performance has been the access to main memory, wh...
To reduce latency and increase bandwidth to memory, modern microprocessors are often designed with d...
Memory (cache, DRAM, and disk) is in charge of providing data and instructions to a computer\u27s pr...
Journal ArticleThe speed gap between processors and memory system is becoming the performance bottle...
On multi-core processors, contention on shared resources such as the last level cache (LLC) and memo...
Minimizing power, increasing performance, and delivering effective memory bandwidth are today's prim...
Journal ArticleConventional microarchitectures choose a single memory hierarchy design point target...
Journal ArticleThe widespread use of repeaters in long wires creates the possibility of dynamically ...
In modern computers, memory hierarchies play a paramount role in improving the average execution tim...
To design computers which reach the performance limits of the implementation technology, one must un...
Current microprocessors improve performance by exploiting instruction-level parallelism (ILP). ILP h...
With the increasing gap between processor speed and memory speed, a sophisticated memory hierarchy i...
The memory hierarchy is predicted to consume up to 40% to 70% of total system power in future data c...
PhD ThesisCurrent microprocessors improve performance by exploiting instruction-level parallelism (I...
The increasing levels of transistor density have enabled integration of an increasing number of core...
textOne of the major limiters to computer system performance has been the access to main memory, wh...
To reduce latency and increase bandwidth to memory, modern microprocessors are often designed with d...
Memory (cache, DRAM, and disk) is in charge of providing data and instructions to a computer\u27s pr...
Journal ArticleThe speed gap between processors and memory system is becoming the performance bottle...
On multi-core processors, contention on shared resources such as the last level cache (LLC) and memo...
Minimizing power, increasing performance, and delivering effective memory bandwidth are today's prim...