The diversity of microarchitecture designs in heterogeneous computing systems allows programs to achieve high performance and energy efficiency, but results in substantial software redevelopment cost for each type or generation of hardware. To mitigate this cost, a performance portable programming system is required. This work presents my solution to the performance portability problem. I argue that a new language is required for replacing the current practices of programming systems to achieve practical performance portability. To support my argument, I first demonstrate the limited performance portability of the current practices by showing quantitative and qualitative evidences. I identify the main limiting issues of conventional pro...
The rising pressure to simultaneously improve performance and reduce power consumption is driving mo...
Graphics Processing Units (GPUs) are now commonplace in computing systems and are the most successf...
Functional languages provide a solid foundation on which complex optimization passes can be designed...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
Because of tight power and energy constraints, industry is progressively shifting toward heterogeneo...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
This thesis addresses issues associated with efficiently programming modern heterogeneous GPU-based ...
It has been widely shown that GPGPU architectures offer large performance gains compared to their tr...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
Abstract—User-level components of applications can be made performance-aware by annotating them with...
It is well acknowledged that the dominant mechanism for scaling processor performance has become to ...
It has been widely shown that GPGPU architectures offer large performance gains compared to their tr...
The need to speed-up computing has introduced the interest to explore parallelism in algorithms and ...
Programmability, performance portability, and resource efficiency have emerged as critical challenge...
Computer systems have become more heterogeneous due to the breakdown of Dennard Scaling and the rapi...
The rising pressure to simultaneously improve performance and reduce power consumption is driving mo...
Graphics Processing Units (GPUs) are now commonplace in computing systems and are the most successf...
Functional languages provide a solid foundation on which complex optimization passes can be designed...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
Because of tight power and energy constraints, industry is progressively shifting toward heterogeneo...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
This thesis addresses issues associated with efficiently programming modern heterogeneous GPU-based ...
It has been widely shown that GPGPU architectures offer large performance gains compared to their tr...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
Abstract—User-level components of applications can be made performance-aware by annotating them with...
It is well acknowledged that the dominant mechanism for scaling processor performance has become to ...
It has been widely shown that GPGPU architectures offer large performance gains compared to their tr...
The need to speed-up computing has introduced the interest to explore parallelism in algorithms and ...
Programmability, performance portability, and resource efficiency have emerged as critical challenge...
Computer systems have become more heterogeneous due to the breakdown of Dennard Scaling and the rapi...
The rising pressure to simultaneously improve performance and reduce power consumption is driving mo...
Graphics Processing Units (GPUs) are now commonplace in computing systems and are the most successf...
Functional languages provide a solid foundation on which complex optimization passes can be designed...