Computing systems have become increasingly complex with the emergence of heterogeneous hardware combining multicore CPUs and GPUs. These parallel systems exhibit tremendous computa-tional power at the cost of increased programming effort. This re-sults in a tension between achieving performance and code porta-bility. Code is either tuned using device-specific optimizations to achieve maximum performance or is written in a high-level lan-guage to achieve portability at the expense of performance. We propose a novel approach that offers high-level program-ming, code portability and high-performance. It is based on algo-rithmic pattern composition coupled with a powerful, yet simple, set of rewrite rules. This enables systematic transformation...
General-purpose GPU-based systems are highly attractive, as they give potentially massive performanc...
Recent developments in processor architecture have settled a shift from sequential processing to par...
Shared memory multi-core processor technology has seen a drastic developmentwith faster and increasi...
Computers have become increasingly complex with the emergence of heterogeneous hardware combining mu...
Computers have become increasingly complex with the emergence of heterogeneous hardware combining mu...
Application development for modern high-performance systems with many cores, i.e., comprising multip...
The rising pressure to simultaneously improve performance and reduce power consumption is driving mo...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
General purpose GPU based systems are highly attractive as they give potentially massive performance...
The problem of automatically generating hardware modules from high level application representations...
Multicore heterogeneous architectures are spreading from desktop computers to mobile and em-bedded s...
Graphics Processing Units (GPUs) are now commonplace in computing systems and are the most successf...
OpenCL is a programming language standard which enables the programmer to express the application by...
International audienceManycore architectures are now available in a wide range of HPC systems. Going...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
General-purpose GPU-based systems are highly attractive, as they give potentially massive performanc...
Recent developments in processor architecture have settled a shift from sequential processing to par...
Shared memory multi-core processor technology has seen a drastic developmentwith faster and increasi...
Computers have become increasingly complex with the emergence of heterogeneous hardware combining mu...
Computers have become increasingly complex with the emergence of heterogeneous hardware combining mu...
Application development for modern high-performance systems with many cores, i.e., comprising multip...
The rising pressure to simultaneously improve performance and reduce power consumption is driving mo...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
General purpose GPU based systems are highly attractive as they give potentially massive performance...
The problem of automatically generating hardware modules from high level application representations...
Multicore heterogeneous architectures are spreading from desktop computers to mobile and em-bedded s...
Graphics Processing Units (GPUs) are now commonplace in computing systems and are the most successf...
OpenCL is a programming language standard which enables the programmer to express the application by...
International audienceManycore architectures are now available in a wide range of HPC systems. Going...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
General-purpose GPU-based systems are highly attractive, as they give potentially massive performanc...
Recent developments in processor architecture have settled a shift from sequential processing to par...
Shared memory multi-core processor technology has seen a drastic developmentwith faster and increasi...