Functional languages provide a solid foundation on which complex optimization passes can be designed to exploit parallelism available in the underlying system. Their mathematical foundations enable high-level optimizations that would be impossible in traditional im-perative languages. This makes them uniquely suited for generation of efficient target code for parallel systems, such as multiple Central Processing Units (CPUs) or highly data-parallel Graphics Process-ing Units (GPUs). Such systems are becoming the mainstream for scientific and commodity desktop computing. Writing performance portable code for such systems using low-level languages requires significant effort from a human expert. This paper presents NOVA, a functional language...
The diversity of microarchitecture designs in heterogeneous computing systems allows programs to ach...
Future supercomputers will rely on massive on-chip parallelism that requires dramatic changes be mad...
GPUs have been gaining popularity as general purpose parallel processors that deliver a performance ...
Functional languages provide a solid foundation on which complex optimization passes can be designed...
It is well acknowledged that the dominant mechanism for scaling processor performance has become to ...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
Graphics processing units (GPUs) are powerful devices capable of rapid parallel computation. GPU pro...
AbstractGraphics processor units (GPUs) have evolved to handle throughput oriented workloads where a...
Graphics processing units (GPUs) provide a low cost platform for accelerating high performance compu...
The need to speed-up computing has introduced the interest to explore parallelism in algorithms and ...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
High-level domain-specific languages for array processing on the GPU are increasingly common, but th...
have emerged as a powerful accelerator for general-purpose computations. GPUs are attached to every ...
This chapter demonstrates how to leverage the Thrust parallel template library to implement high-per...
The aim of this thesis is to research how the functional paradigm can be used for hardware accelerat...
The diversity of microarchitecture designs in heterogeneous computing systems allows programs to ach...
Future supercomputers will rely on massive on-chip parallelism that requires dramatic changes be mad...
GPUs have been gaining popularity as general purpose parallel processors that deliver a performance ...
Functional languages provide a solid foundation on which complex optimization passes can be designed...
It is well acknowledged that the dominant mechanism for scaling processor performance has become to ...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
Graphics processing units (GPUs) are powerful devices capable of rapid parallel computation. GPU pro...
AbstractGraphics processor units (GPUs) have evolved to handle throughput oriented workloads where a...
Graphics processing units (GPUs) provide a low cost platform for accelerating high performance compu...
The need to speed-up computing has introduced the interest to explore parallelism in algorithms and ...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
High-level domain-specific languages for array processing on the GPU are increasingly common, but th...
have emerged as a powerful accelerator for general-purpose computations. GPUs are attached to every ...
This chapter demonstrates how to leverage the Thrust parallel template library to implement high-per...
The aim of this thesis is to research how the functional paradigm can be used for hardware accelerat...
The diversity of microarchitecture designs in heterogeneous computing systems allows programs to ach...
Future supercomputers will rely on massive on-chip parallelism that requires dramatic changes be mad...
GPUs have been gaining popularity as general purpose parallel processors that deliver a performance ...