The architecture diversity of many-core processors - with their different types of cores, and memory hierarchies - makes the old model of reprogramming every application for every platform infeasible. Therefore, inter-platform portability has become a desirable feature of programming models. While functional portability is ensured by standards and compilers (e.g., OpenCL), to achieve high performance across platforms remains a much more challenging task. In this thesis, we have investigated the enabling/disabling techniques for platform-specific optimizations with a unified programming model. We have selected OpenCL as our research vehicle, and identified that each platform has a specific optimization space for a given kernel. Taking two co...
146 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2008.My work discusses various str...
Recent developments in processor architecture have settled a shift from sequential processing to par...
Accelerator processors allow energy-efficient computation at high performance, especially for comput...
The architecture diversity of many-core processors - with their different types of cores, and memory...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
Application development for modern high-performance systems with many cores, i.e., comprising multip...
The rising pressure to simultaneously improve performance and reduce power consumption is driving mo...
Recently, multi-cores chips have become omnipresent in computer systems ranging from high-end server...
Utilizing heterogeneous platforms for computation has become a general trend, making the portability...
In the last few years, the computing industry has changed its course from ever higher clock speeds t...
Abstract—Due to the diversity of processor architectures and application memory access patterns, the...
Shared memory multi-core processor technology has seen a drastic developmentwith faster and increasi...
Open Computing Language (OpenCL) is an open standard for writing portable software for heterogeneous...
OpenCL is a standard for parallel programming of heterogeneous systems. The benefits of a common pro...
Programmability, performance portability, and resource efficiency have emerged as critical challenge...
146 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2008.My work discusses various str...
Recent developments in processor architecture have settled a shift from sequential processing to par...
Accelerator processors allow energy-efficient computation at high performance, especially for comput...
The architecture diversity of many-core processors - with their different types of cores, and memory...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
Application development for modern high-performance systems with many cores, i.e., comprising multip...
The rising pressure to simultaneously improve performance and reduce power consumption is driving mo...
Recently, multi-cores chips have become omnipresent in computer systems ranging from high-end server...
Utilizing heterogeneous platforms for computation has become a general trend, making the portability...
In the last few years, the computing industry has changed its course from ever higher clock speeds t...
Abstract—Due to the diversity of processor architectures and application memory access patterns, the...
Shared memory multi-core processor technology has seen a drastic developmentwith faster and increasi...
Open Computing Language (OpenCL) is an open standard for writing portable software for heterogeneous...
OpenCL is a standard for parallel programming of heterogeneous systems. The benefits of a common pro...
Programmability, performance portability, and resource efficiency have emerged as critical challenge...
146 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2008.My work discusses various str...
Recent developments in processor architecture have settled a shift from sequential processing to par...
Accelerator processors allow energy-efficient computation at high performance, especially for comput...