This study has been carried out in order to determine cost-effective configurations of functional units for multiple-issue out-of-order superscalar processors. The trace-driven simulations were performed on the six integer and the fourteen floating-point programs from the SPEC 92 suite. We first evaluate the number of instructions allowed to be concurrently processed by the execution stages of the pipeline. We then apply some restrictions on the execution issue of different instruction classes in order to define these configurations. We conclude that five to nine functional units are necessary to exploit Instruction-Level Parallelism. An important point is that several data cache ports are required in a processor of degree 4 or more. Finall...
We present a simple technique for instruction-level parallelism and analyze its performance impact. ...
We present a technique for ameliorating the detrimental impact of the true data dependencies that ul...
High-performance, general-purpose microprocessors serve as compute engines for computers ranging fro...
In out-of-order issue superscalar microprocessors, instructions must be buffered before they are iss...
The main aim of this short paper is to investigate multiple-instruction-issue in a high-performance ...
Multiscalar processors use a new, aggressive implementation paradigm for extracting large quantities...
Superscalar processing is the latest in a long series of innovations aimed at producing ever-faster ...
Chip multiprocessors (CMPs) aim to develop both instruction-level and thread-level parallelisms to b...
Superscalar in-order processors form an interesting alternative to out-of-order processors because o...
This paper describes a novel processor architecture, called hyperscalar processor architecture, whic...
Loops are the main time consuming part of programs based on floating point computations. The perform...
This dissertation demonstrates that through the careful application of hardware and software techniq...
A great deal of the current research into computer architecture is directed at Multiple Instruction ...
A mechanistic model for out-of-order superscalar processors is developed and then applied to the stu...
Superscalar in-order processors form an interesting alternative to out-of-order processors because o...
We present a simple technique for instruction-level parallelism and analyze its performance impact. ...
We present a technique for ameliorating the detrimental impact of the true data dependencies that ul...
High-performance, general-purpose microprocessors serve as compute engines for computers ranging fro...
In out-of-order issue superscalar microprocessors, instructions must be buffered before they are iss...
The main aim of this short paper is to investigate multiple-instruction-issue in a high-performance ...
Multiscalar processors use a new, aggressive implementation paradigm for extracting large quantities...
Superscalar processing is the latest in a long series of innovations aimed at producing ever-faster ...
Chip multiprocessors (CMPs) aim to develop both instruction-level and thread-level parallelisms to b...
Superscalar in-order processors form an interesting alternative to out-of-order processors because o...
This paper describes a novel processor architecture, called hyperscalar processor architecture, whic...
Loops are the main time consuming part of programs based on floating point computations. The perform...
This dissertation demonstrates that through the careful application of hardware and software techniq...
A great deal of the current research into computer architecture is directed at Multiple Instruction ...
A mechanistic model for out-of-order superscalar processors is developed and then applied to the stu...
Superscalar in-order processors form an interesting alternative to out-of-order processors because o...
We present a simple technique for instruction-level parallelism and analyze its performance impact. ...
We present a technique for ameliorating the detrimental impact of the true data dependencies that ul...
High-performance, general-purpose microprocessors serve as compute engines for computers ranging fro...