In order to meet the demands of wider issue processors, fetch mechanisms will need to fetch multiple basic blocks per cycle. The trace cache supplies several basic blocks each cycle by storing logically contiguous instructions in physically contiguous storage. When a particular basic block is requested, the trace cache can potentially respond with the requested block along with several blocks that followed it when the block was last encountered. In this technical report, we examine some critical features of a trace cache mechanism designed for a 16-wide issue processor and evaluate their effects on performance. We examine features such as cache associativity, storage partitioning, branch predictor design, instruction cache design, and fill ...
In this paper we address the important problem of instruction fetch for future wide issue superscala...
The design of higher performance processors has been following two major trends: increasing the pipe...
Techniques such as out-of-order issue and speculative execution aggressively exploit instruction lev...
To maximize the performance of a wide-issue superscalar processor, the fetch mechanism must be capab...
As the issue width of superscalar processors is increased, instruction fetch bandwidth requirements ...
As the instruction issue width of superscalar proces-sors increases, instruction fetch bandwidth req...
The trace cache is a recently proposed solution to achieving high instruction fetch bandwidth by buf...
The trace cache is a recently proposed solution to achieving high instruction fetch bandwidth by buf...
Trace caches are used to help dynamic branch prediction make multiple predictions in a cycle by embe...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
The objective of this paper is to improve the use of the hardware resources of the trace cache mecha...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
The objective of this paper is to improve the use of the hardware resources of the trace cache mecha...
Trace cache, an instruction fetch technique that reduces taken branch penalties by storing and fetch...
In this paper we address the important problem of instruction fetch for future wide issue superscala...
The design of higher performance processors has been following two major trends: increasing the pipe...
Techniques such as out-of-order issue and speculative execution aggressively exploit instruction lev...
To maximize the performance of a wide-issue superscalar processor, the fetch mechanism must be capab...
As the issue width of superscalar processors is increased, instruction fetch bandwidth requirements ...
As the instruction issue width of superscalar proces-sors increases, instruction fetch bandwidth req...
The trace cache is a recently proposed solution to achieving high instruction fetch bandwidth by buf...
The trace cache is a recently proposed solution to achieving high instruction fetch bandwidth by buf...
Trace caches are used to help dynamic branch prediction make multiple predictions in a cycle by embe...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
The objective of this paper is to improve the use of the hardware resources of the trace cache mecha...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
The objective of this paper is to improve the use of the hardware resources of the trace cache mecha...
Trace cache, an instruction fetch technique that reduces taken branch penalties by storing and fetch...
In this paper we address the important problem of instruction fetch for future wide issue superscala...
The design of higher performance processors has been following two major trends: increasing the pipe...
Techniques such as out-of-order issue and speculative execution aggressively exploit instruction lev...