In the modern era of wire-dominated architectures, specific effort must be made to reduce needless communication within out-of-order pipelines while still maintaining binary compatibility. To ease pressure on highly-connected elements such as the issue logic and bypass network, we propose the dynamic detection and speculative execution of instruction strands–linear chains of dependent instructions without intermediate fan-out. The hardware required for detecting these chains is simple and resides off the critical path of the pipeline, and the execution targets are the normal ALUs with a self-bypass mode. By collapsing these strings of dependencies into atomic macro-instructions, the efficiency of the issue queue and reorder buffer can be in...
Modern processors remove many artificial constraints on instruction ordering,permitting multiple ins...
The performance of pipelined processors is severely limited by data dependencies. In order to achiev...
An architecture that features dynamic multithreading execution of a single program is studied in thi...
In recent years, with only small fractions of modern processors now accessible in a single cycle, co...
Pipelined microprocessors allow the simultaneous execution of several machine instructions at a time...
In a dynamic reordering superscalar processor, the front-end fetches instructions and places them in...
In this paper, we propose a new issue queue design that is capable of scheduling reusable instructio...
In recent years, microprocessor manufacturers have shifted their focus from single-core to multi-cor...
Speculatively issued instructions may be particularly sensitive to increases in pipeline depth. Our ...
In simultaneous multithreaded architectures many separate threads are running concurrently, sharing ...
Pipelining the scheduling logic, which exposes and exploits the instruction level parallelism, degra...
At present there exist three main schools of thought for improving single-threaded program performan...
Bypass delays are expected to grow beyond 1ns as technology scales. These delays necessitate pipelin...
Instruction issue logic is a critical component in modern high-performance out-of-order processors. ...
textSilicon reliability has reemerged as a very important problem in digital system design. As volta...
Modern processors remove many artificial constraints on instruction ordering,permitting multiple ins...
The performance of pipelined processors is severely limited by data dependencies. In order to achiev...
An architecture that features dynamic multithreading execution of a single program is studied in thi...
In recent years, with only small fractions of modern processors now accessible in a single cycle, co...
Pipelined microprocessors allow the simultaneous execution of several machine instructions at a time...
In a dynamic reordering superscalar processor, the front-end fetches instructions and places them in...
In this paper, we propose a new issue queue design that is capable of scheduling reusable instructio...
In recent years, microprocessor manufacturers have shifted their focus from single-core to multi-cor...
Speculatively issued instructions may be particularly sensitive to increases in pipeline depth. Our ...
In simultaneous multithreaded architectures many separate threads are running concurrently, sharing ...
Pipelining the scheduling logic, which exposes and exploits the instruction level parallelism, degra...
At present there exist three main schools of thought for improving single-threaded program performan...
Bypass delays are expected to grow beyond 1ns as technology scales. These delays necessitate pipelin...
Instruction issue logic is a critical component in modern high-performance out-of-order processors. ...
textSilicon reliability has reemerged as a very important problem in digital system design. As volta...
Modern processors remove many artificial constraints on instruction ordering,permitting multiple ins...
The performance of pipelined processors is severely limited by data dependencies. In order to achiev...
An architecture that features dynamic multithreading execution of a single program is studied in thi...