High speed scalar processing is an essential characteristic of high performance general purpose computer systems. Highly concurrent execution of scalar code is difficult due to data de-pendencies and conditional branches. This paper proposes an architectural concept called guarded instructions to reduce the penalty of conditional branches in deeply pipelined processors. A code generation heuristic, the decision tree scheduling tech-nlque, reorders instructions in a complex of basic blocks so as to make efficient use of guarded instructions. Performance evalu-ation of several benchmarks are presented, including a module from the UNIX kernel. Even with these difficult scalar code ex-amples, a speedup of two is achievable by using conventional...
Superscalar architectural techniques increase instruction throughput by increasing resources and usi...
An architecture for high-performance scalar computation is proposed and discussed. The main feature ...
The work presents a new principle for microprocessor design based on a pairwise balanced combinatori...
Extensive research as been done on extracting parallelism from single instruction stream processors....
Superscalar and superpipelining techniques increase the overlap between the instructions in a pipeli...
Pipelining is a pervasive hardware implementation strategy used to increase the execution speeds of ...
Super-scalar processors can execute multiple instructions out-of-order per cycle and speculatively ...
Dynamic configuration of application-specific implicit instructions has been proposed to better expl...
Pipelined microprocessors allow the simultaneous execution of several machine instructions at a time...
This dissertation demonstrates that through the careful application of hardware and software techniq...
High performance computer architectures increasingly use compile-time instruction scheduling to reor...
Our goal is to dramatically increase the performance of uniprocessors through the exploitation of in...
The increasing density of VLSI circuits has motivated research into ways to utilize large area budge...
[[abstract]]A pipelined processor increases its performance by partitioning an instruction into seve...
SIMP is a novel multiple instruction-pipeline parallel architecture. It is targeted for enhancing th...
Superscalar architectural techniques increase instruction throughput by increasing resources and usi...
An architecture for high-performance scalar computation is proposed and discussed. The main feature ...
The work presents a new principle for microprocessor design based on a pairwise balanced combinatori...
Extensive research as been done on extracting parallelism from single instruction stream processors....
Superscalar and superpipelining techniques increase the overlap between the instructions in a pipeli...
Pipelining is a pervasive hardware implementation strategy used to increase the execution speeds of ...
Super-scalar processors can execute multiple instructions out-of-order per cycle and speculatively ...
Dynamic configuration of application-specific implicit instructions has been proposed to better expl...
Pipelined microprocessors allow the simultaneous execution of several machine instructions at a time...
This dissertation demonstrates that through the careful application of hardware and software techniq...
High performance computer architectures increasingly use compile-time instruction scheduling to reor...
Our goal is to dramatically increase the performance of uniprocessors through the exploitation of in...
The increasing density of VLSI circuits has motivated research into ways to utilize large area budge...
[[abstract]]A pipelined processor increases its performance by partitioning an instruction into seve...
SIMP is a novel multiple instruction-pipeline parallel architecture. It is targeted for enhancing th...
Superscalar architectural techniques increase instruction throughput by increasing resources and usi...
An architecture for high-performance scalar computation is proposed and discussed. The main feature ...
The work presents a new principle for microprocessor design based on a pairwise balanced combinatori...