277 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1997.For many applications, speculative execution alone is not sufficient to achieve high performance. The fundamental limitation is that speculation only removes dependences between branches and other instructions. The branches themselves remain in the code, which causes difficult problems. This motivates the second technique investigated in this dissertation, predicated execution, which is an architectural capability that enables the conditional execution of instructions based on the value of a Boolean source operand. Predicated execution allows a compiler to eliminate branch instructions using this conditional execution support. Additionally, predicated execution provides ...
High performance architectures have always had to deal with the performance-limiting impact of branc...
Instruction Level Parallelism (ILP) speedups of an order-of-magnitude or greater may be possible usi...
Partial redundancy elimination (PRE) is one of the most widespread optimizations in compilers. Howev...
Predicated execution has been used to reduce the number of branch mispredictions by eliminating hard...
Conventional speculative architectures use branch prediction to evaluate the most likely execution p...
Speculative execution has long been used as an approach to exploit instruction level parallelism acr...
The performance of modern processors is increasingly de-pendent on their ability to execute multiple...
In a recent paper by Smith, Lam and Horowitz [1] the concept of 'boosting' was introduced, where ins...
Modern compilers must expose sufficient amounts of Instruction-Level Parallelism (ILP) to achieve th...
Predicated execution is a promising architectural feature for exploiting instruction-level paralleli...
The available instruction level parallelism (ILP) is extremely limited within basic blocks of non-nu...
Increasing system complexity of SOC applications leads to an increased need of powerful embedded DSP...
Architectural support for predicated execution has been proposed as a manner of attacking performanc...
textEven after decades of research in branch prediction, branch predictors still remain imperfect, w...
In this paper we evaluate the effects of guarded (or conditional, or predicated) execution on the pe...
High performance architectures have always had to deal with the performance-limiting impact of branc...
Instruction Level Parallelism (ILP) speedups of an order-of-magnitude or greater may be possible usi...
Partial redundancy elimination (PRE) is one of the most widespread optimizations in compilers. Howev...
Predicated execution has been used to reduce the number of branch mispredictions by eliminating hard...
Conventional speculative architectures use branch prediction to evaluate the most likely execution p...
Speculative execution has long been used as an approach to exploit instruction level parallelism acr...
The performance of modern processors is increasingly de-pendent on their ability to execute multiple...
In a recent paper by Smith, Lam and Horowitz [1] the concept of 'boosting' was introduced, where ins...
Modern compilers must expose sufficient amounts of Instruction-Level Parallelism (ILP) to achieve th...
Predicated execution is a promising architectural feature for exploiting instruction-level paralleli...
The available instruction level parallelism (ILP) is extremely limited within basic blocks of non-nu...
Increasing system complexity of SOC applications leads to an increased need of powerful embedded DSP...
Architectural support for predicated execution has been proposed as a manner of attacking performanc...
textEven after decades of research in branch prediction, branch predictors still remain imperfect, w...
In this paper we evaluate the effects of guarded (or conditional, or predicated) execution on the pe...
High performance architectures have always had to deal with the performance-limiting impact of branc...
Instruction Level Parallelism (ILP) speedups of an order-of-magnitude or greater may be possible usi...
Partial redundancy elimination (PRE) is one of the most widespread optimizations in compilers. Howev...