Increasing system complexity of SOC applications leads to an increased need of powerful embedded DSP processors. To fulfill the required computational bandwidth, state-of-the-art DSP processors allow executing several instructions in parallel and for reaching higher clock frequencies they increase the number of pipeline stages. However, deeply pipelined processors have drawbacks in the execution of branch instructions: branch delays. In average not more than two branch delay slots can be used, additional ones keep unused and decrease the overall system performance. Instead of compensating the drawback of branch delays (e.g. branch prediction circuits) it is possible to reduce the number of branch delays by reducing the number of branch inst...
Pipeline stalls due to branches represent one of the most significant impediments to realizing the p...
Speculative execution of conditional branches has a high hardware cost, is limited by dynamic branc...
Abstract – This paper presents a new code optimiza-tion technique for a class of embedded processors...
Conventional speculative architectures use branch prediction to evaluate the most likely execution p...
277 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1997.For many applications, specul...
Predicated execution has been used to reduce the number of branch mispredictions by eliminating hard...
High performance architectures have always had to deal with the performance-limiting impact of branc...
In this paper we evaluate the effects of guarded (or conditional, or predicated) execution on the pe...
Pipelined microprocessors allow the simultaneous execution of several machine instructions at a time...
One of the main obstacles to exploiting the fine-grained parallelism that is available in general-pu...
With increasing demands on mobile communication transfer rates the circuits in mobile phones must be...
Predicated execution is an e#cient mechanism to avoid conditional constructs in application programs...
This paper proposes a new deterministic branch prediction unit to achieve a uniformly timed instruct...
Predicated Execution has been put forth as a method for improving processor performance by removing ...
Though current general-purpose processors have several small CPU cores as opposed to a single more c...
Pipeline stalls due to branches represent one of the most significant impediments to realizing the p...
Speculative execution of conditional branches has a high hardware cost, is limited by dynamic branc...
Abstract – This paper presents a new code optimiza-tion technique for a class of embedded processors...
Conventional speculative architectures use branch prediction to evaluate the most likely execution p...
277 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1997.For many applications, specul...
Predicated execution has been used to reduce the number of branch mispredictions by eliminating hard...
High performance architectures have always had to deal with the performance-limiting impact of branc...
In this paper we evaluate the effects of guarded (or conditional, or predicated) execution on the pe...
Pipelined microprocessors allow the simultaneous execution of several machine instructions at a time...
One of the main obstacles to exploiting the fine-grained parallelism that is available in general-pu...
With increasing demands on mobile communication transfer rates the circuits in mobile phones must be...
Predicated execution is an e#cient mechanism to avoid conditional constructs in application programs...
This paper proposes a new deterministic branch prediction unit to achieve a uniformly timed instruct...
Predicated Execution has been put forth as a method for improving processor performance by removing ...
Though current general-purpose processors have several small CPU cores as opposed to a single more c...
Pipeline stalls due to branches represent one of the most significant impediments to realizing the p...
Speculative execution of conditional branches has a high hardware cost, is limited by dynamic branc...
Abstract – This paper presents a new code optimiza-tion technique for a class of embedded processors...