Decoupled computer architectures provide high scalar performance by exploiting the fine--grained parallelism existing between the access and execute functions in a computer program. These architectures employ an access processor to perform data fetch ahead of demand by the execute process. Some of the decoupled architectures employ identical access and execute processors, but special processors to efficiently access data structures have also been proposed. In this paper, we present the design of an access processor targeted for VLSI implementation. The hardware elements, instruction set and instruction formats of the processor are presented in sufficient detail. The various trade-offs involved in the design are explained. Details of impleme...
124 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1982.When conventional von Neumann...
Providing adequate data bandwidth is extremely important for a wide-issue superscalar processor to a...
The purpose of this paper is to show that using decoupling techniques in a vector processor, the per...
We present a decoupled architecture of processors with a memory hierarchy of only scratch–pad memori...
An architecture for high-performance scalar computation is proposed and discussed. The main feature ...
This paper discusses an approach to reducing memory latency in future systems. It focuses on systems...
Decoupled computer architectures partition the memory access and execute functions in a computer pro...
Summarization: Mapping computational intensive applications on reconfigurable technology for acceler...
It is my great pleasure to serve as guest editor for this special issue of TCCA Newsletter, which is...
In this paper we investigate the behavior of data prefetching on an access decoupled machine and a s...
Several studies have demonstrated that out-of-order execution processors may not be the most adequat...
Decoupling is an architectural organization that may tolerate long memory latencies by executing mem...
Several studies have demonstrated that out-of-order execution processors may not be the most adequat...
This work presents and evaluates a novel processor microarchitecture which combines two paradigms: a...
Decoupled architectures have not traditionally been used in the context of general purpose computing...
124 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1982.When conventional von Neumann...
Providing adequate data bandwidth is extremely important for a wide-issue superscalar processor to a...
The purpose of this paper is to show that using decoupling techniques in a vector processor, the per...
We present a decoupled architecture of processors with a memory hierarchy of only scratch–pad memori...
An architecture for high-performance scalar computation is proposed and discussed. The main feature ...
This paper discusses an approach to reducing memory latency in future systems. It focuses on systems...
Decoupled computer architectures partition the memory access and execute functions in a computer pro...
Summarization: Mapping computational intensive applications on reconfigurable technology for acceler...
It is my great pleasure to serve as guest editor for this special issue of TCCA Newsletter, which is...
In this paper we investigate the behavior of data prefetching on an access decoupled machine and a s...
Several studies have demonstrated that out-of-order execution processors may not be the most adequat...
Decoupling is an architectural organization that may tolerate long memory latencies by executing mem...
Several studies have demonstrated that out-of-order execution processors may not be the most adequat...
This work presents and evaluates a novel processor microarchitecture which combines two paradigms: a...
Decoupled architectures have not traditionally been used in the context of general purpose computing...
124 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1982.When conventional von Neumann...
Providing adequate data bandwidth is extremely important for a wide-issue superscalar processor to a...
The purpose of this paper is to show that using decoupling techniques in a vector processor, the per...