Abstract—Mobile and PC/server class processor companies continue to roll out flagship core microarchitectures that are faster than their predecessors. Meanwhile placing more cores on a chip coupled with constant supply voltage puts per-core energy consumption at a premium. Hence, the challenge is to find future microarchitecture optimizations that not only increase performance but also conserve energy. Eliminating branch mispredictions—which waste both time and energy—is valuable in this respect. In this paper, we explore the control-flow landscape by characterizing mispredictions in four benchmark suites. We find that a third of mispredictions-per-1K-instructions (MPKI) come from what we call separable branches: branches with large control...
This paper presents the concept of dynamic control independence (DCI) and shows how it can be detect...
Modern out-of-order processor architectures focus significantly on the high performance execution of...
Irregular control-flow structures like deeply nested conditional branches are common in real-world s...
Current processors exploit out-of-order execution and branch prediction to improve instruction level...
Conventional front-end designs attempt to maximize the number of "in-flight" instructions in the pip...
This paper proposes a new processor architecture for handling hard-to-predict branches, the diverge-...
With the constant advances in technology that lead to the increasing of the transistor count and pro...
Pipelined microprocessors allow the simultaneous execution of several machine instructions at a time...
Though current general-purpose processors have several small CPU cores as opposed to a single more c...
To alleviate the memory wall problem, current architec-tural trends suggest implementing large instr...
To alleviate the memory wall problem, current architectural trends suggest implementing large instru...
textEven after decades of research in branch prediction, branch predictors still remain imperfect, w...
Instruction pipelining, out-of-order execution, and branch prediction are techniques that improve pe...
Energy efficiency is of the utmost importance in modern high-performance embedded processor design. ...
The evolution of computer systems to continuously improve execution efficiency has traditionally emb...
This paper presents the concept of dynamic control independence (DCI) and shows how it can be detect...
Modern out-of-order processor architectures focus significantly on the high performance execution of...
Irregular control-flow structures like deeply nested conditional branches are common in real-world s...
Current processors exploit out-of-order execution and branch prediction to improve instruction level...
Conventional front-end designs attempt to maximize the number of "in-flight" instructions in the pip...
This paper proposes a new processor architecture for handling hard-to-predict branches, the diverge-...
With the constant advances in technology that lead to the increasing of the transistor count and pro...
Pipelined microprocessors allow the simultaneous execution of several machine instructions at a time...
Though current general-purpose processors have several small CPU cores as opposed to a single more c...
To alleviate the memory wall problem, current architec-tural trends suggest implementing large instr...
To alleviate the memory wall problem, current architectural trends suggest implementing large instru...
textEven after decades of research in branch prediction, branch predictors still remain imperfect, w...
Instruction pipelining, out-of-order execution, and branch prediction are techniques that improve pe...
Energy efficiency is of the utmost importance in modern high-performance embedded processor design. ...
The evolution of computer systems to continuously improve execution efficiency has traditionally emb...
This paper presents the concept of dynamic control independence (DCI) and shows how it can be detect...
Modern out-of-order processor architectures focus significantly on the high performance execution of...
Irregular control-flow structures like deeply nested conditional branches are common in real-world s...