DS is a new microarchitecture that combines decoupled (DAE) and superscalar techniques to exploit instruction level parallelism. Programs for DS are compiled into two instruction substreams: the dominant substream navigates the control flow and the rest of computational task is shared between the dominant and subsidiary substreams. Each substream is processed by a separate superscalar core realizable with current VLSI technology. The implementation complexity usually found in a monolithic superscalar processor can be decentralized in each superscalar core. DS machines are binary compatible with superscalar machines having the same instruction set, and DS machines in a family are binary compatible. Three special instructions, BRQ, GETQ, and ...
A great deal of the current research into computer architecture is directed at Multiple Instruction ...
This thesis describes work done in two areas of compilation support for superscalar processors; regi...
High performance computer architectures increasingly use compile-time instruction scheduling to reor...
We present a simple technique for instruction-level parallelism and analyze its performance impact. ...
We present a technique for ameliorating the detrimental impact of the true data dependencies that ul...
A distributed instruction queue (DIQ) in a superscalar microprocessor supports multi-instruction iss...
Superscalar processing is the latest in a long series of innovations aimed at producing ever-faster ...
[[abstract]]This paper introduces a novel superscalar micro-architecture, called IAS-S, and its rela...
has emphasized instruction-level parallelism, which improves performance by increasing the number of...
The invention involves new microarchitecture apparatus and methods for superscalar microprocessors t...
To maximize the performance of wide-issue superscalar out-of-order microprocessors, the issue stage ...
The increasing density of VLSI circuits has motivated research into ways to utilize large area budge...
Multiscalar processors use a new, aggressive implementation paradigm for extracting large quantities...
The work presents a new principle for microprocessor design based on a pairwise balanced combinatori...
If a high-performance superscalar processor is to realise its full potential, the complier must re-o...
A great deal of the current research into computer architecture is directed at Multiple Instruction ...
This thesis describes work done in two areas of compilation support for superscalar processors; regi...
High performance computer architectures increasingly use compile-time instruction scheduling to reor...
We present a simple technique for instruction-level parallelism and analyze its performance impact. ...
We present a technique for ameliorating the detrimental impact of the true data dependencies that ul...
A distributed instruction queue (DIQ) in a superscalar microprocessor supports multi-instruction iss...
Superscalar processing is the latest in a long series of innovations aimed at producing ever-faster ...
[[abstract]]This paper introduces a novel superscalar micro-architecture, called IAS-S, and its rela...
has emphasized instruction-level parallelism, which improves performance by increasing the number of...
The invention involves new microarchitecture apparatus and methods for superscalar microprocessors t...
To maximize the performance of wide-issue superscalar out-of-order microprocessors, the issue stage ...
The increasing density of VLSI circuits has motivated research into ways to utilize large area budge...
Multiscalar processors use a new, aggressive implementation paradigm for extracting large quantities...
The work presents a new principle for microprocessor design based on a pairwise balanced combinatori...
If a high-performance superscalar processor is to realise its full potential, the complier must re-o...
A great deal of the current research into computer architecture is directed at Multiple Instruction ...
This thesis describes work done in two areas of compilation support for superscalar processors; regi...
High performance computer architectures increasingly use compile-time instruction scheduling to reor...