Modern embedded processors access the Branch Target Buffer (BTB) every cycle to speculate branch target addresses. Such accesses, quite often, are unnecessary as there is no branch instruction among those fetched. In this work we introduce Branchless Cycle Prediction (BLCP) to exploit this design inefficiency. BLCP uses a simple power efficient structure to predict cycles where there is no branch instruction among those fetched, at least one cycle in advance. We avoid accessing BTB during such cycles. We show that, by using BLCP, it is possible to reduce BTB power dissipation by 32 % while paying a negligible performance cos
Modern processors use branch prediction as an optimization to improve processor performance. Predict...
CMOS technology scaling improves the speed and functionality of microprocessors by reducing the size...
Nowadays energy-efficiency becomes the first design metric in chip development. To pursue higher ene...
Ever increasing performance requirements have elevated deeply pipelined architectures to a standard ...
Although high branch prediction accuracy is necessary for high performance, it typically comes at th...
Energy efficiency is of the utmost importance in modern high-performance embedded processor design. ...
Accurate branch prediction is critical to performance; mispredicted branches mean that ten’s of cycl...
Processor architectures will increasingly rely on issuing multiple instructions to make full use of ...
This paper explores the role of branch predictor organization in power/energy/performance tradeoffs ...
Accurate branch prediction can improve processor performance, while reducing energy waste. Though so...
Main goal of the paper is introducing a dynamic branch prediction scheme suitable for energy-aware V...
The need to minimize power while maximizing performance has led to recent developments of powerful s...
The paper introduces a dynamic branch prediction scheme suitable for energy-aware Very Long Instruct...
Dynamic branch predictor logic alone accounts for approximately 10% of total processor power dissipa...
Modern processors use branch prediction as an optimization to improve processor performance. Predict...
Modern processors use branch prediction as an optimization to improve processor performance. Predict...
CMOS technology scaling improves the speed and functionality of microprocessors by reducing the size...
Nowadays energy-efficiency becomes the first design metric in chip development. To pursue higher ene...
Ever increasing performance requirements have elevated deeply pipelined architectures to a standard ...
Although high branch prediction accuracy is necessary for high performance, it typically comes at th...
Energy efficiency is of the utmost importance in modern high-performance embedded processor design. ...
Accurate branch prediction is critical to performance; mispredicted branches mean that ten’s of cycl...
Processor architectures will increasingly rely on issuing multiple instructions to make full use of ...
This paper explores the role of branch predictor organization in power/energy/performance tradeoffs ...
Accurate branch prediction can improve processor performance, while reducing energy waste. Though so...
Main goal of the paper is introducing a dynamic branch prediction scheme suitable for energy-aware V...
The need to minimize power while maximizing performance has led to recent developments of powerful s...
The paper introduces a dynamic branch prediction scheme suitable for energy-aware Very Long Instruct...
Dynamic branch predictor logic alone accounts for approximately 10% of total processor power dissipa...
Modern processors use branch prediction as an optimization to improve processor performance. Predict...
Modern processors use branch prediction as an optimization to improve processor performance. Predict...
CMOS technology scaling improves the speed and functionality of microprocessors by reducing the size...
Nowadays energy-efficiency becomes the first design metric in chip development. To pursue higher ene...