This paper presents the concept of dynamic control independence (DCI) and shows how it can be detected and exploited in an out-of-order superscalar processor to reduce the performance penalties of branch mispredictions. We show how DCI can be leveraged during branch misprediction recovery to reduce the number of instructions squashed on a misprediction as well as how it can be used to avoid predicting unpredictable branches by fetching instructions out-of-order. A practical implementation is described and evaluated using six SPECint95 benchmarks. We show that exploiting DCI during branch misprediction recovery improves performance by 0.9-9.9% on a 4-wide processor, by 1.8-11.2% on an 8-wide processor and by 1.915. 3% on a 12-wide processor....
The negative performance impact of branch mis-predictions can be reduced by exploiting control indep...
With the help of the memory dependence predictor the instruction scheduler can speculatively issue l...
High performance microprocessors have relied on accurate branch predictors to maintain high instruct...
Current processors exploit out-of-order execution and branch prediction to improve instruction level...
Though current general-purpose processors have several small CPU cores as opposed to a single more c...
Current trends in modern out-of-order processors involve imple-menting deeper pipelines and a large ...
Current trends in modern out-of-order processors involve implementing deeper pipelines and a large i...
An instruction is control independent of a preceding conditional branch if the decision to execute t...
Many algorithms are inherently sequential and hard to explicitly parallelize. Cores designed to aggr...
To continue to improve processor performance, microarchitects seek to increase the effective instruc...
To continue to improve processor performance, microar-chitects seek to increase the effective instru...
Abstract: In our previously published research we discovered some very difficult to predict branches...
As the issue width and depth of pipelining of high performance superscalar processors increase, the ...
An instruction is control independent of a preceding conditional branch if the decision to execute t...
In a dynamic reordering superscalar processor, the front-end fetches instructions and places them in...
The negative performance impact of branch mis-predictions can be reduced by exploiting control indep...
With the help of the memory dependence predictor the instruction scheduler can speculatively issue l...
High performance microprocessors have relied on accurate branch predictors to maintain high instruct...
Current processors exploit out-of-order execution and branch prediction to improve instruction level...
Though current general-purpose processors have several small CPU cores as opposed to a single more c...
Current trends in modern out-of-order processors involve imple-menting deeper pipelines and a large ...
Current trends in modern out-of-order processors involve implementing deeper pipelines and a large i...
An instruction is control independent of a preceding conditional branch if the decision to execute t...
Many algorithms are inherently sequential and hard to explicitly parallelize. Cores designed to aggr...
To continue to improve processor performance, microarchitects seek to increase the effective instruc...
To continue to improve processor performance, microar-chitects seek to increase the effective instru...
Abstract: In our previously published research we discovered some very difficult to predict branches...
As the issue width and depth of pipelining of high performance superscalar processors increase, the ...
An instruction is control independent of a preceding conditional branch if the decision to execute t...
In a dynamic reordering superscalar processor, the front-end fetches instructions and places them in...
The negative performance impact of branch mis-predictions can be reduced by exploiting control indep...
With the help of the memory dependence predictor the instruction scheduler can speculatively issue l...
High performance microprocessors have relied on accurate branch predictors to maintain high instruct...