Superscalar and superpipelining techniques increase the overlap between the instructions in a pipelined processor, and thus these techniques have the potential to improve processor performance by decreasing the average number of cycles between the execution of adjacent instructions. Yet, to obtain this potential performance benefit, an instruction scheduler for this high-performance processor must find the independent instructions within the instruction stream of an application to execute in parallel. For non-numerical applications, there is an insufficient number of independent instructions within a basic block, and consequently the instruction scheduler must search across the basic block boundaries for the extra instruction-level parallel...
Instruction Level Parallelism (ILP) speedups of an order-of-magnitude or greater may be possible usi...
Current trends in processor design are pointing to deeper and wider pipelines and superscalar archit...
In a dynamic reordering superscalar processor, the front-end fetches instructions and places them in...
The foremost goal of superscalar processor design is to increase performance through the exploitatio...
The available instruction level parallelism (ILP) is extremely limited within basic blocks of non-nu...
Control and data flow speculation can improve processor performance through increased ILP. First it ...
A superscalar processor supporting speculative ex-ecution requires an instruction fetch mechanism th...
Pipelining the scheduling logic, which exposes and exploits the instruction level parallelism, degra...
Simultaneous Multi-Threading (SMT) processors improve system performance by allowing concurrent exec...
Current trends in processor design are pointing to deeper and wider pipelines and superscalar archit...
International audienceTo maximize performance, out-of-order execution processors sometimes issue ins...
We present a technique for ameliorating the detrimental impact of the true data dependencies that ul...
In a recent paper by Smith, Lam and Horowitz [1] the concept of 'boosting' was introduced, where ins...
A simultaneous multithreaded (SMT) processor is able to issue and execute instructions from several ...
The major specific contributions are: (1) We introduce a new compiler analysis to identify the memor...
Instruction Level Parallelism (ILP) speedups of an order-of-magnitude or greater may be possible usi...
Current trends in processor design are pointing to deeper and wider pipelines and superscalar archit...
In a dynamic reordering superscalar processor, the front-end fetches instructions and places them in...
The foremost goal of superscalar processor design is to increase performance through the exploitatio...
The available instruction level parallelism (ILP) is extremely limited within basic blocks of non-nu...
Control and data flow speculation can improve processor performance through increased ILP. First it ...
A superscalar processor supporting speculative ex-ecution requires an instruction fetch mechanism th...
Pipelining the scheduling logic, which exposes and exploits the instruction level parallelism, degra...
Simultaneous Multi-Threading (SMT) processors improve system performance by allowing concurrent exec...
Current trends in processor design are pointing to deeper and wider pipelines and superscalar archit...
International audienceTo maximize performance, out-of-order execution processors sometimes issue ins...
We present a technique for ameliorating the detrimental impact of the true data dependencies that ul...
In a recent paper by Smith, Lam and Horowitz [1] the concept of 'boosting' was introduced, where ins...
A simultaneous multithreaded (SMT) processor is able to issue and execute instructions from several ...
The major specific contributions are: (1) We introduce a new compiler analysis to identify the memor...
Instruction Level Parallelism (ILP) speedups of an order-of-magnitude or greater may be possible usi...
Current trends in processor design are pointing to deeper and wider pipelines and superscalar archit...
In a dynamic reordering superscalar processor, the front-end fetches instructions and places them in...