In the past, instruction fetch speeds have been improved by using cache schemes that capture the actual program flow. In this proposal, we present the architecture of a new instruction cache named code pattern cache (CPC); the cache is used with superscalar processors. CPC?s operation is based on the fundamental principles that: common programs tend to repeat their execution patterns; and efficient storage of a program flow can enhance the performance of an instruction fetch mechanism. CPC saves basic blocks (sets of instructions separated by control instructions) and their boundary addresses while the code is running. Basic blocks and their addresses are stored in two separate structures, called block pointer cache (BPC) and basic block ca...
Trace cache, an instruction fetch technique that reduces taken branch penalties by storing and fetch...
Recent supers calar processors issue four tnstructzons per cycle. These processors are also powered ...
As the issue width of superscalar processors is increased, instruction fetch bandwidth requirements ...
To maximize the performance of a wide-issue superscalar processor, the fetch mechanism must be capab...
Techniques such as out-of-order issue and speculative execution aggressively exploit instruction lev...
The design of higher performance processors has been following two major trends: increasing the pipe...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
The design of higher performance processors has been following two major trends: increasing the pipe...
As the instruction issue width of superscalar proces-sors increases, instruction fetch bandwidth req...
Fetch performance is a very important factor because it effectively limits the overall processor per...
Future processors combining out-of-order execution with aggressive speculation techniques will need ...
Application-specific system-on-chip platforms create the opportunity to customize the cache configur...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
In superscalar processors, capable of issuing and executing multiple instructions per cycle, fetch p...
Trace cache, an instruction fetch technique that reduces taken branch penalties by storing and fetch...
Recent supers calar processors issue four tnstructzons per cycle. These processors are also powered ...
As the issue width of superscalar processors is increased, instruction fetch bandwidth requirements ...
To maximize the performance of a wide-issue superscalar processor, the fetch mechanism must be capab...
Techniques such as out-of-order issue and speculative execution aggressively exploit instruction lev...
The design of higher performance processors has been following two major trends: increasing the pipe...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
The design of higher performance processors has been following two major trends: increasing the pipe...
As the instruction issue width of superscalar proces-sors increases, instruction fetch bandwidth req...
Fetch performance is a very important factor because it effectively limits the overall processor per...
Future processors combining out-of-order execution with aggressive speculation techniques will need ...
Application-specific system-on-chip platforms create the opportunity to customize the cache configur...
In the past, instruction fetch speeds have been improved by using cache schemes that capture the act...
In superscalar processors, capable of issuing and executing multiple instructions per cycle, fetch p...
Trace cache, an instruction fetch technique that reduces taken branch penalties by storing and fetch...
Recent supers calar processors issue four tnstructzons per cycle. These processors are also powered ...
As the issue width of superscalar processors is increased, instruction fetch bandwidth requirements ...