High instruction fetch bandwidth is essential for high performance in today’s wide-issue out-of-order processors. Instruction caches must provide a low miss rate as well as low latency. We introduce Procedure Level Relocation, a class of dynamic feedbackdirected optimizations that substantially reduce the instruction cache miss rate by exploiting the temporal locality of procedure usage. Based on the observation that half of all procedures executed are at most 128 bytes in length, we present a Small Procedure Cache, a small and fast explicitly managed memory for storing small procedures. We show that Procedure Level Relocation into a Small Procedure Cache reduces the instruction cache miss rate by an average of 15%.
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
The speed of processors increases much faster than the memory access time. This makes memory accesse...
In the embedded domain, the gap between memory and processor performance and the increase in applica...
High instruction fetch bandwidth is essential for high performance in today’s wide-issue outof-order...
Truly incremental development is a holy grail of verification-intensive software industry. All facto...
(eng) Instruction cache performance is one of the bottle-necks of processor performance. In this pap...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
In the past decade, processor speed has become signicantly faster than memory speed. Small, fast cac...
Commercial link : http://www.springerlink.de/ ALCHEMY/http://www.springer.comCache memories were inv...
As the gap between memory and processor speeds continues to widen, cache efficiency is an increasing...
© 1994 ACM. In the past decade, processor speed has become significantly faster than memory speed. S...
Trace cache, an instruction fetch technique that reduces taken branch penalties by storing and fetch...
This work addresses the problem of the increasing performance disparity between the microprocessor a...
The trend in computer architecture is that processor speeds are increasing rapidly compared to memor...
In the embedded domain, the gap between memory and processor performance and the increase in applica...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
The speed of processors increases much faster than the memory access time. This makes memory accesse...
In the embedded domain, the gap between memory and processor performance and the increase in applica...
High instruction fetch bandwidth is essential for high performance in today’s wide-issue outof-order...
Truly incremental development is a holy grail of verification-intensive software industry. All facto...
(eng) Instruction cache performance is one of the bottle-necks of processor performance. In this pap...
In the past decade, processor speed has become significantly faster than memory speed. Small, fast c...
In the past decade, processor speed has become signicantly faster than memory speed. Small, fast cac...
Commercial link : http://www.springerlink.de/ ALCHEMY/http://www.springer.comCache memories were inv...
As the gap between memory and processor speeds continues to widen, cache efficiency is an increasing...
© 1994 ACM. In the past decade, processor speed has become significantly faster than memory speed. S...
Trace cache, an instruction fetch technique that reduces taken branch penalties by storing and fetch...
This work addresses the problem of the increasing performance disparity between the microprocessor a...
The trend in computer architecture is that processor speeds are increasing rapidly compared to memor...
In the embedded domain, the gap between memory and processor performance and the increase in applica...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
The speed of processors increases much faster than the memory access time. This makes memory accesse...
In the embedded domain, the gap between memory and processor performance and the increase in applica...