High instruction fetch bandwidth is essential for high performance in today’s wide-issue outof-order processors. Instruction caches must provide a low miss rate as well as low latency. We introduce Procedure Level Relocation, a class of dynamic feedback-directed optimizations that substantially reduce the instruction cache miss rate by exploiting the temporal locality of procedure usage. Based on the observation that half of all procedures executed are at most 128 bytes in length, we present a Small Procedure Cache, a small and fast explicitly managed memory for storing small procedures. We show that Procedure Level Relocation into a Small Procedure Cache reduces the instruction cache miss rate by an average of 15%Technical report DCS-TR-53
In the embedded domain, the gap between memory and processor performance and the increase in applica...
The processor speeds continue to improve at a faster rate than the memory access times. The issue of...
Instruction-cache misses account for up to 40%; of execution time in online transaction processing (...
High instruction fetch bandwidth is essential for high performance in today’s wide-issue out-of-orde...
Truly incremental development is a holy grail of verification-intensive software industry. All facto...
(eng) Instruction cache performance is one of the bottle-necks of processor performance. In this pap...
In the past decade, processor speed has become signicantly faster than memory speed. Small, fast cac...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
Commercial link : http://www.springerlink.de/ ALCHEMY/http://www.springer.comCache memories were inv...
As the gap between memory and processor speeds continues to widen, cache efficiency is an increasing...
© 1994 ACM. In the past decade, processor speed has become significantly faster than memory speed. S...
Trace cache, an instruction fetch technique that reduces taken branch penalties by storing and fetch...
This work addresses the problem of the increasing performance disparity between the microprocessor a...
In the embedded domain, the gap between memory and processor performance and the increase in applica...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
In the embedded domain, the gap between memory and processor performance and the increase in applica...
The processor speeds continue to improve at a faster rate than the memory access times. The issue of...
Instruction-cache misses account for up to 40%; of execution time in online transaction processing (...
High instruction fetch bandwidth is essential for high performance in today’s wide-issue out-of-orde...
Truly incremental development is a holy grail of verification-intensive software industry. All facto...
(eng) Instruction cache performance is one of the bottle-necks of processor performance. In this pap...
In the past decade, processor speed has become signicantly faster than memory speed. Small, fast cac...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
Commercial link : http://www.springerlink.de/ ALCHEMY/http://www.springer.comCache memories were inv...
As the gap between memory and processor speeds continues to widen, cache efficiency is an increasing...
© 1994 ACM. In the past decade, processor speed has become significantly faster than memory speed. S...
Trace cache, an instruction fetch technique that reduces taken branch penalties by storing and fetch...
This work addresses the problem of the increasing performance disparity between the microprocessor a...
In the embedded domain, the gap between memory and processor performance and the increase in applica...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
In the embedded domain, the gap between memory and processor performance and the increase in applica...
The processor speeds continue to improve at a faster rate than the memory access times. The issue of...
Instruction-cache misses account for up to 40%; of execution time in online transaction processing (...