This report examines ultra-fine grain machine parallelism determined by various hardware styles and constraints. Two major components are incorporated in our system: (1) A generalized parameterized architecture model which characterizes different design styles and constraints based on parallel pipelined machines. (2) A retargetable compiler which maps instruction parallelism to ultra-fine grain machine parallelism for target architectures. Basically the generalized parameterized model is used to specify different target machines, and the retargetable compiler compiles and schedules applications, codes written in high-level language, into control codes for given target machines. The resulting control codes are run through a simulator, after ...
A coarse-grain parallel program typically has one thread (task) per processor, whereas a fine-grain ...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
With the present availability of parallel processors of vastly different architectures, there is a n...
This report examines ultra-fine grain machine parallelism determined by various hardware styles and ...
This thesis investigates parallelism and hardware design trade-offs of parallel and pipelined archit...
The increasing density of VLSI circuits has motivated research into ways to utilize large area budge...
With the rise of chip-multiprocessors, the problem of parallelizing general-purpose programs has onc...
The end of Dennard scaling also brought an end to frequency scaling as a means to improve performanc...
Power consumption and fabrication limitations are increasingly playing significant roles in the desi...
While the chip multiprocessor (CMP) has quickly become the predominant processor architecture, its c...
We describe an approach to parallel compilation that seeks to harness the vast amount of fine-grain ...
Irregular applications have frequent data-dependent memory accesses and control flow. They arise in ...
In this paper we analyze the effect of compiler optimizations on fine grain parallelism in scalar pr...
. Research into automatic extraction of instruction-level parallelism and data parallelism from sequ...
Graduation date: 1995There seems to be a consensus that future Massively Parallel Architectures\ud w...
A coarse-grain parallel program typically has one thread (task) per processor, whereas a fine-grain ...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
With the present availability of parallel processors of vastly different architectures, there is a n...
This report examines ultra-fine grain machine parallelism determined by various hardware styles and ...
This thesis investigates parallelism and hardware design trade-offs of parallel and pipelined archit...
The increasing density of VLSI circuits has motivated research into ways to utilize large area budge...
With the rise of chip-multiprocessors, the problem of parallelizing general-purpose programs has onc...
The end of Dennard scaling also brought an end to frequency scaling as a means to improve performanc...
Power consumption and fabrication limitations are increasingly playing significant roles in the desi...
While the chip multiprocessor (CMP) has quickly become the predominant processor architecture, its c...
We describe an approach to parallel compilation that seeks to harness the vast amount of fine-grain ...
Irregular applications have frequent data-dependent memory accesses and control flow. They arise in ...
In this paper we analyze the effect of compiler optimizations on fine grain parallelism in scalar pr...
. Research into automatic extraction of instruction-level parallelism and data parallelism from sequ...
Graduation date: 1995There seems to be a consensus that future Massively Parallel Architectures\ud w...
A coarse-grain parallel program typically has one thread (task) per processor, whereas a fine-grain ...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
With the present availability of parallel processors of vastly different architectures, there is a n...