Modern superscalar processors are able to potentially issue and execute multiple instructions per cycle. Several techniques over the years have focused on increasing the Instruction Level Parallelism (ILP) that a processor can exploit. However, there are many limitations of ILP that hinder performance, chief of them being the chain of dependencies between instructions that stops instructions from being executed in parallel. We propose a new micro-architecture design which extends the superscalar pipeline with a data-flow pipeline where the dataflow part identifies immediately dependent instructions and executes them early. The dataflow pipeline is able to identify redundant instructions, track changes in the operands of the redundant instru...
Superscalar microprocessors currently power the majority of computing machines. These processors ar...
Instruction Level Parallelism (ILP) speedups of an order-of-magnitude or greater may be possible usi...
Pipelining is a pervasive hardware implementation strategy used to increase the execution speeds of ...
Modern superscalar processors are able to potentially issue and execute multiple instructions per cy...
In this dissertation, we explore the concept of dynamic dependency collapsing. Performance increases...
Journal ArticleModern superscalar processors use wide instruction issue widths and out-of-order exe...
LaZy Superscalar is a processor architecture which delays the execution of fetched instructions unti...
The automatic and implicit transformation of sequential instruction streams, which execute efficient...
SIMP is a novel multiple instruction-pipeline parallel architecture. It is targeted for enhancing th...
Boosting instruction level parallelism in dynamically scheduled processors requires a large instruct...
This dissertation discusses a novel, previously unexplored execution model called Demand-Driven Exec...
Our goal is to dramatically increase the performance of uniprocessors through the exploitation of in...
To continue to improve processor performance, microarchitects seek to increase the effective instruc...
Instruction scheduling is one of the most important optimisations performed when producing code in a...
We present a technique for ameliorating the detrimental impact of the true data dependencies that ul...
Superscalar microprocessors currently power the majority of computing machines. These processors ar...
Instruction Level Parallelism (ILP) speedups of an order-of-magnitude or greater may be possible usi...
Pipelining is a pervasive hardware implementation strategy used to increase the execution speeds of ...
Modern superscalar processors are able to potentially issue and execute multiple instructions per cy...
In this dissertation, we explore the concept of dynamic dependency collapsing. Performance increases...
Journal ArticleModern superscalar processors use wide instruction issue widths and out-of-order exe...
LaZy Superscalar is a processor architecture which delays the execution of fetched instructions unti...
The automatic and implicit transformation of sequential instruction streams, which execute efficient...
SIMP is a novel multiple instruction-pipeline parallel architecture. It is targeted for enhancing th...
Boosting instruction level parallelism in dynamically scheduled processors requires a large instruct...
This dissertation discusses a novel, previously unexplored execution model called Demand-Driven Exec...
Our goal is to dramatically increase the performance of uniprocessors through the exploitation of in...
To continue to improve processor performance, microarchitects seek to increase the effective instruc...
Instruction scheduling is one of the most important optimisations performed when producing code in a...
We present a technique for ameliorating the detrimental impact of the true data dependencies that ul...
Superscalar microprocessors currently power the majority of computing machines. These processors ar...
Instruction Level Parallelism (ILP) speedups of an order-of-magnitude or greater may be possible usi...
Pipelining is a pervasive hardware implementation strategy used to increase the execution speeds of ...