International audienceMost high performance general purpose processors leverage register renaming to implement optimizations such as move elimination or zero-idiom elimination. Those optimizations can be seen as forms of strength reduction whereby a faster but semantically equivalent operation is substituted to a slower operation. In this letter, we argue that other reductions can be performed dynamically if input values of instructions are known in time, i.e.,~prior to renaming. We study the potential for leveraging Value Prediction to achieve that goal and show that in SPEC2k17, an average of 3.3% (up to 6.8%) of the dynamic instructions could dynamically be strength reduced. Our experiments suggest that a state-of-the-art value predictor...
With the constant advances in technology that lead to the increasing of the transistor count and pro...
Modern processors rely heavily on speculation to provide performance. Techniques such as branch pred...
As processors continue to exploit more instruction level parallelism, greater demands are placed on ...
International audienceValue Prediction (VP) is a microarchitectural technique that speculatively bre...
International audience—Recently, Value Prediction (VP) has been gaining renewed traction in the rese...
This paper proposes and evaluates a new microarchitecture for out-of-order processors that supports ...
The storage for speculative values in superscalar processors is one of the main sources of complexit...
In this paper, we present a novel mechanism that implements register renaming, dynamic speculation a...
Value prediction breaks data dependencies in a pro-gram thereby creating instruction level paralleli...
The storage for speculative values in superscalar processors is one of the main sources of complexit...
Value prediction breaks data dependencies in a program thereby creating instruction level parallelis...
The fact that instructions in programs often produce repetitive results has motivated researchers to...
The effectiveness of static code optimizations--including static optimizations performed just-in-ti...
The effectiveness of static code optimizations--including static optimizations performed "just-...
this paper, we propose combining three prediction mechanisms into a hybrid predictor. Each predictor...
With the constant advances in technology that lead to the increasing of the transistor count and pro...
Modern processors rely heavily on speculation to provide performance. Techniques such as branch pred...
As processors continue to exploit more instruction level parallelism, greater demands are placed on ...
International audienceValue Prediction (VP) is a microarchitectural technique that speculatively bre...
International audience—Recently, Value Prediction (VP) has been gaining renewed traction in the rese...
This paper proposes and evaluates a new microarchitecture for out-of-order processors that supports ...
The storage for speculative values in superscalar processors is one of the main sources of complexit...
In this paper, we present a novel mechanism that implements register renaming, dynamic speculation a...
Value prediction breaks data dependencies in a pro-gram thereby creating instruction level paralleli...
The storage for speculative values in superscalar processors is one of the main sources of complexit...
Value prediction breaks data dependencies in a program thereby creating instruction level parallelis...
The fact that instructions in programs often produce repetitive results has motivated researchers to...
The effectiveness of static code optimizations--including static optimizations performed just-in-ti...
The effectiveness of static code optimizations--including static optimizations performed "just-...
this paper, we propose combining three prediction mechanisms into a hybrid predictor. Each predictor...
With the constant advances in technology that lead to the increasing of the transistor count and pro...
Modern processors rely heavily on speculation to provide performance. Techniques such as branch pred...
As processors continue to exploit more instruction level parallelism, greater demands are placed on ...