In this thesis, I describe the evaluation framework for Rigel, a 1024-core single-chip accelerator architecture designed for high throughput on visual computing and scientific workloads. I present an integrated evaluation framework for investigating co-designed architecture, compilers, programming models, and RTL implementation for massively parallel chip multiprocessors (CMPs). The research objective of the Rigel project, designing a prototype thousand-core chip, led to the development of the framework presented in this thesis. I describe the tools and techniques which enabled this work. The goal of this thesis is not to evaluate specific design tradeoffs, but to describe the tools we have developed for making these decisions....
AbstractAs the Pawsey Centre project continues, in 2013 iVEC was tasked with deciding which accelera...
In order to reach exascale computing capability, accelerators have become a crucial part in developi...
thesisTo address the need of understanding and optimizing the performance of complex applications an...
In this thesis, I describe the evaluation framework for Rigel, a 1024-core single-chip accelerator ...
This dissertation describes work on the architecture of throughput-oriented accelerator processors. ...
Memory model design is a major part of any modern processor architecture. There are many design cho...
There is a large, emerging, and commercially relevant class of applications which stands to be enabl...
The Rigel compute accelerator has been developed to explore alternative architectures for massively ...
Scientific applications rely heavily on floating point data types. Floating point operations are co...
......Increasing demand for perfor-mance on data-intensive parallel workloads has driven the design ...
Parallel programming requires a significant amount of developer effort, and creating optimized paral...
Nowadays many-core computing platforms are widely adopted as a viable solution to accelerate compute...
Accelerators, such as GPUs and Intel Xeon Phis, have become the workhorses of high-performance compu...
A new computer architecture, intended for implementation in late and post silicon technologies, is p...
AbstractPerformance benchmarks should be embedded in comprehensive frameworks that suitably set thei...
AbstractAs the Pawsey Centre project continues, in 2013 iVEC was tasked with deciding which accelera...
In order to reach exascale computing capability, accelerators have become a crucial part in developi...
thesisTo address the need of understanding and optimizing the performance of complex applications an...
In this thesis, I describe the evaluation framework for Rigel, a 1024-core single-chip accelerator ...
This dissertation describes work on the architecture of throughput-oriented accelerator processors. ...
Memory model design is a major part of any modern processor architecture. There are many design cho...
There is a large, emerging, and commercially relevant class of applications which stands to be enabl...
The Rigel compute accelerator has been developed to explore alternative architectures for massively ...
Scientific applications rely heavily on floating point data types. Floating point operations are co...
......Increasing demand for perfor-mance on data-intensive parallel workloads has driven the design ...
Parallel programming requires a significant amount of developer effort, and creating optimized paral...
Nowadays many-core computing platforms are widely adopted as a viable solution to accelerate compute...
Accelerators, such as GPUs and Intel Xeon Phis, have become the workhorses of high-performance compu...
A new computer architecture, intended for implementation in late and post silicon technologies, is p...
AbstractPerformance benchmarks should be embedded in comprehensive frameworks that suitably set thei...
AbstractAs the Pawsey Centre project continues, in 2013 iVEC was tasked with deciding which accelera...
In order to reach exascale computing capability, accelerators have become a crucial part in developi...
thesisTo address the need of understanding and optimizing the performance of complex applications an...