To continue to improve processor performance, microarchitects seek to increase the effective instruction level parallelism (ILP) that can be exploited in applications. A fundamental limit to improving ILP is data dependences among instructions. If data dependence information is available at run-time, there are many uses to improve ILP. Prior published examples include decoupled branch exectuion architectures and critical instruction detection. In this paper, we describe an efficient hardware mechanism to dynamically track the data dependence chains of the instructions in the pipeline. This information is available on a cycle-by-cycle basis to the microengine for optimizing its perfromance. We then use this design in a new value-based branc...
Abstract — Branch prediction has been playing an increas-ingly important role in improving the perfo...
A larger instruction window on Out-of-Order (OoO) cores facilitates better exploitation of inherent ...
Although some instructions hurt performance more than others, current processors typically apply sch...
To continue to improve processor performance, microar-chitects seek to increase the effective instru...
textPerformance of modern pipelined processor depends on steady flow of useful instructions for proc...
As the issue width and depth of pipelining of high performance superscalar processors increase, the ...
Modern superscalar processors rely on branch predictors to sustain a high instruction fetch throughp...
The importance of accurate branch prediction to future processors has been widely recognized. The co...
Instruction Level Parallelism (ILP) is one of the key issues to boost the performance of future gene...
Value prediction breaks data dependencies in a program thereby creating instruction level parallelis...
Branch prediction is critical in exploring instruction level parallelism for modern processors. Prev...
Value prediction attempts to eliminate true-data dependencies by dynamically predicting the outcome ...
This paper presents the concept of dynamic control independence (DCI) and shows how it can be detect...
Although some instructions hurt performance more than oth-ers, current processors typically apply sc...
Dependencies between instructions restrict the instruction-level parallelism, and make difficult for...
Abstract — Branch prediction has been playing an increas-ingly important role in improving the perfo...
A larger instruction window on Out-of-Order (OoO) cores facilitates better exploitation of inherent ...
Although some instructions hurt performance more than others, current processors typically apply sch...
To continue to improve processor performance, microar-chitects seek to increase the effective instru...
textPerformance of modern pipelined processor depends on steady flow of useful instructions for proc...
As the issue width and depth of pipelining of high performance superscalar processors increase, the ...
Modern superscalar processors rely on branch predictors to sustain a high instruction fetch throughp...
The importance of accurate branch prediction to future processors has been widely recognized. The co...
Instruction Level Parallelism (ILP) is one of the key issues to boost the performance of future gene...
Value prediction breaks data dependencies in a program thereby creating instruction level parallelis...
Branch prediction is critical in exploring instruction level parallelism for modern processors. Prev...
Value prediction attempts to eliminate true-data dependencies by dynamically predicting the outcome ...
This paper presents the concept of dynamic control independence (DCI) and shows how it can be detect...
Although some instructions hurt performance more than oth-ers, current processors typically apply sc...
Dependencies between instructions restrict the instruction-level parallelism, and make difficult for...
Abstract — Branch prediction has been playing an increas-ingly important role in improving the perfo...
A larger instruction window on Out-of-Order (OoO) cores facilitates better exploitation of inherent ...
Although some instructions hurt performance more than others, current processors typically apply sch...