RingScalar is a complexity-effective microarchitecture for out-of-order superscalar processors, that reduces the area, latency, and power of all major structures in the instruction flow. The design divides an N-way superscalar into N columns connected in a unidirectional ring, where each column contains a portion of the instruction window, a bank of the register file, and an ALU. The design exploits the fact that most decoded instructions are waiting on just one operand to use only a single tag per issue window entry, and to restrict instruction wakeup and value bypass to only communicate with the neighboring column. Detailed simulations of four-issue single-threaded machines running SPECint2000 show that RingScalar has IPC only 13% lowe...
International audienceThis paper proposes a new microarchitectural scheme for reducing the hardware ...
Applications vary in the degree of instruction level parallelism (ILP) available to be exploited by ...
There is still much performance to be gained by out-of-order processors with wider issue widths. Ho...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
High performance superscalar microarchitectures exploit instruction-level parallelism (ILP) to impro...
The advance of integration allows implementation of very wide issue superscalar processors on a sing...
The performance tradeoff between hardware complexity and clock speed is studied. First, a generic su...
Journal ArticleModern superscalar processors use wide instruction issue widths and out-of-order exe...
Current superscalar processors feature 64-bit datapaths to execute the program instructions, regardl...
The poor scalability of existing superscalar processors has been of great concern to the computer en...
International audienceDuring the past 10 years, the clock frequency of high-end superscalar processo...
has emphasized instruction-level parallelism, which improves performance by increasing the number of...
Journal ArticleDynamic superscalar processors execute multiple instructions out-of-order by looking ...
Complexity is one of the most important problems facing microarchitects. It is exacerbated by the ap...
Journal ArticleClustered microarchitectures are an attractive alternative to large monolithic super...
International audienceThis paper proposes a new microarchitectural scheme for reducing the hardware ...
Applications vary in the degree of instruction level parallelism (ILP) available to be exploited by ...
There is still much performance to be gained by out-of-order processors with wider issue widths. Ho...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
High performance superscalar microarchitectures exploit instruction-level parallelism (ILP) to impro...
The advance of integration allows implementation of very wide issue superscalar processors on a sing...
The performance tradeoff between hardware complexity and clock speed is studied. First, a generic su...
Journal ArticleModern superscalar processors use wide instruction issue widths and out-of-order exe...
Current superscalar processors feature 64-bit datapaths to execute the program instructions, regardl...
The poor scalability of existing superscalar processors has been of great concern to the computer en...
International audienceDuring the past 10 years, the clock frequency of high-end superscalar processo...
has emphasized instruction-level parallelism, which improves performance by increasing the number of...
Journal ArticleDynamic superscalar processors execute multiple instructions out-of-order by looking ...
Complexity is one of the most important problems facing microarchitects. It is exacerbated by the ap...
Journal ArticleClustered microarchitectures are an attractive alternative to large monolithic super...
International audienceThis paper proposes a new microarchitectural scheme for reducing the hardware ...
Applications vary in the degree of instruction level parallelism (ILP) available to be exploited by ...
There is still much performance to be gained by out-of-order processors with wider issue widths. Ho...