As power-efficiency becomes paramount concern in processor design, architectures are coming up that completely do away with hardware branch prediction, and rely solely on software branch hinting. A popular example is the Synergistic Pro-cessing Unit (SPU) in the IBM Cell processor. To be able to minimize the branch penalty using branch hint instructions, in addition to estimating the branch probabilities (which has been looked at before [6, 25, 24]), it is important to care-fully insert branch hints. Towards this, in this paper, we i) construct a branch penalty model for compiler, ii) formulate the problem of minimizing branch penalty using branch hint-ing and iii) propose a heuristic to solve this problem. The heuristic is based on three b...
Branch prediction accuracy is a very important factor for superscalar processor performance. The abi...
Processor architectures will increasingly rely on issuing multiple instructions to make full use of ...
Modern superscalar processors rely on branch predictors to sustain a high instruction fetch throughp...
Energy-efficient dynamic branch predictors are proposed for the Cell SPE, which normally depends on ...
There is wide agreement that one of the most important impediments to the performance of current and...
There is wide agreement that one of the most important impediments to the performance of current and...
Dynamic branch predictor logic alone accounts for approximately 10% of total processor power dissipa...
High performance architectures have always had to deal with the performance-limiting impact of branc...
textPerformance of modern pipelined processor depends on steady flow of useful instructions for proc...
Pipeline stalls due to branches represent one of the most significant impediments to realizing the p...
To attain peak efficiency, high performance processors must anticipate changes in the flow of contro...
Although high branch prediction accuracy is necessary for high performance, it typically comes at th...
Energy efficiency is of the utmost importance in modern high-performance embedded processor design. ...
Accurate branch prediction can improve processor performance, while reducing energy waste. Though so...
In superpipeline microarchitecture, the instruction execution cycle is divided into many stages. Thi...
Branch prediction accuracy is a very important factor for superscalar processor performance. The abi...
Processor architectures will increasingly rely on issuing multiple instructions to make full use of ...
Modern superscalar processors rely on branch predictors to sustain a high instruction fetch throughp...
Energy-efficient dynamic branch predictors are proposed for the Cell SPE, which normally depends on ...
There is wide agreement that one of the most important impediments to the performance of current and...
There is wide agreement that one of the most important impediments to the performance of current and...
Dynamic branch predictor logic alone accounts for approximately 10% of total processor power dissipa...
High performance architectures have always had to deal with the performance-limiting impact of branc...
textPerformance of modern pipelined processor depends on steady flow of useful instructions for proc...
Pipeline stalls due to branches represent one of the most significant impediments to realizing the p...
To attain peak efficiency, high performance processors must anticipate changes in the flow of contro...
Although high branch prediction accuracy is necessary for high performance, it typically comes at th...
Energy efficiency is of the utmost importance in modern high-performance embedded processor design. ...
Accurate branch prediction can improve processor performance, while reducing energy waste. Though so...
In superpipeline microarchitecture, the instruction execution cycle is divided into many stages. Thi...
Branch prediction accuracy is a very important factor for superscalar processor performance. The abi...
Processor architectures will increasingly rely on issuing multiple instructions to make full use of ...
Modern superscalar processors rely on branch predictors to sustain a high instruction fetch throughp...