If a high-performance superscalar processor is to realise its full potential, the complier must re-order or schedule the object code at compile time. This scheduling creates groups of adjacent instructions that are independent and which therefore can be issued and executed in parallel at run time. This paper provides an overview of the Hatfield Superscalar Architecture (HSA), a multiple-instruction-issue architecture developed at the University of Hertfordshire to support the development of high-performance instruction schedulers. The long-term objective of the HSA is to develop the scheduling technology to realise an order of magnitude performance improvement over traditional RISC designs. The paper also presents results from the first HSA...
This thesis describes work done in two areas of compilation support for superscalar processors; regi...
This document presents a comprehensive overview of the Hatfield Superscalar Scheduler (HSS). It conc...
LaZy Superscalar is a processor architecture which delays the execution of fetched instructions unti...
It is increasingly accepted that superscalar processors can only achieve their full performance pote...
A great deal of the current research into computer architecture is directed at Multiple Instruction ...
High performance computer architectures increasingly use compile-time instruction scheduling to reor...
We present a simple technique for instruction-level parallelism and analyze its performance impact. ...
One of the main obstacles to exploiting the fine-grained parallelism that is available in general-pu...
Superscalar and VLIW processors can both execute multiple instructions each cycle. Each employs a di...
While delayed branch mechanisms were popular with the designers of RISC processors, most superscalar...
We present a technique for ameliorating the detrimental impact of the true data dependencies that ul...
[[abstract]]This paper introduces a novel superscalar micro-architecture, called IAS-S, and its rela...
The main aim of this short paper is to investigate multiple-instruction-issue in a high-performance ...
Extensive research as been done on extracting parallelism from single instruction stream processors....
Due to the character of the original source materials and the nature of batch digitization, quality ...
This thesis describes work done in two areas of compilation support for superscalar processors; regi...
This document presents a comprehensive overview of the Hatfield Superscalar Scheduler (HSS). It conc...
LaZy Superscalar is a processor architecture which delays the execution of fetched instructions unti...
It is increasingly accepted that superscalar processors can only achieve their full performance pote...
A great deal of the current research into computer architecture is directed at Multiple Instruction ...
High performance computer architectures increasingly use compile-time instruction scheduling to reor...
We present a simple technique for instruction-level parallelism and analyze its performance impact. ...
One of the main obstacles to exploiting the fine-grained parallelism that is available in general-pu...
Superscalar and VLIW processors can both execute multiple instructions each cycle. Each employs a di...
While delayed branch mechanisms were popular with the designers of RISC processors, most superscalar...
We present a technique for ameliorating the detrimental impact of the true data dependencies that ul...
[[abstract]]This paper introduces a novel superscalar micro-architecture, called IAS-S, and its rela...
The main aim of this short paper is to investigate multiple-instruction-issue in a high-performance ...
Extensive research as been done on extracting parallelism from single instruction stream processors....
Due to the character of the original source materials and the nature of batch digitization, quality ...
This thesis describes work done in two areas of compilation support for superscalar processors; regi...
This document presents a comprehensive overview of the Hatfield Superscalar Scheduler (HSS). It conc...
LaZy Superscalar is a processor architecture which delays the execution of fetched instructions unti...