In the pursuit of instruction-level parallelism, significant demands are placed on a processor's instruction delivery mechanism. In order to provide the performance necessary to meet future processor execution targets, the instruction delivery mechanism must scale with the execution core. Attaining these targets is a challenging task due to I-cache misses, branch mispredictions, and taken branches in the instruction stream. Moreover, there are a number of hardware scaling issues such as wire latency, clock scaling, and energy dissipation that can impact processor design. To address these issues, this thesis presents a fetch architecture that decouples the branch predictor from the instruction fetch unit. A Fetch Target Queue (FTQ) is ins...
Instruction cache miss latency is becoming an increasingly important performance bottleneck, especia...
Despite the extensive deployment of multi-core architectures in the past few years, the design and o...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
In the pursuit of instruction-level parallelism, significant demands are placed on a processor's ins...
In the pursuit of instruction-level parallelism, significant demands are placed on a processor’s ins...
The design of higher performance processors has been following two major trends: increasing the pipe...
Fetch performance is a very important factor because it effectively limits the overall processor per...
The continually increasing speed of microprocessors stresses the need for ever faster instruction fe...
The design of higher performance processors has been following two major trends: increasing the pipe...
The design of higher performance processors has been following two major trends: increasing the pipe...
The stream fetch engine is a high-performance fetch architecture based on the concept of an instruct...
To exploit larger amounts of parallelism, processors are being built with ever wider issue widths. U...
To exploit larger amounts of parallelism, processors are being built with ever wider issue widths. U...
Fetch performance is a very important factor because it effectively limits the overall processor per...
Abstract—Computer architecture is beset by two opposing trends. Technology scaling and deep pipelini...
Instruction cache miss latency is becoming an increasingly important performance bottleneck, especia...
Despite the extensive deployment of multi-core architectures in the past few years, the design and o...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
In the pursuit of instruction-level parallelism, significant demands are placed on a processor's ins...
In the pursuit of instruction-level parallelism, significant demands are placed on a processor’s ins...
The design of higher performance processors has been following two major trends: increasing the pipe...
Fetch performance is a very important factor because it effectively limits the overall processor per...
The continually increasing speed of microprocessors stresses the need for ever faster instruction fe...
The design of higher performance processors has been following two major trends: increasing the pipe...
The design of higher performance processors has been following two major trends: increasing the pipe...
The stream fetch engine is a high-performance fetch architecture based on the concept of an instruct...
To exploit larger amounts of parallelism, processors are being built with ever wider issue widths. U...
To exploit larger amounts of parallelism, processors are being built with ever wider issue widths. U...
Fetch performance is a very important factor because it effectively limits the overall processor per...
Abstract—Computer architecture is beset by two opposing trends. Technology scaling and deep pipelini...
Instruction cache miss latency is becoming an increasingly important performance bottleneck, especia...
Despite the extensive deployment of multi-core architectures in the past few years, the design and o...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...