AbstractThe past decade has produced numerous CPU architectural innovations. These have included multiple cores per CPU, multiple simultaneous threads per core, and, especially with GPUs, highly complex memory hierarchies. As a result, performance porta-bility has become a major challenge to programmers. We identify the SIMD engines in modern CPU and GPU cores as the key to obtaining high performance for scientific application codes. This common element of all present computing devices makes performance portability possible. However, we find that achieving this performance requires us to express the code in terms of intrinsic functions for the SIMD engine instructions, and these functions are different for each device. To assist the program...
Graphics processing units (GPUs) provide a low cost platform for accelerating high performance compu...
Abstract. Graphical Processing Units (GPUs) have shown acceleration factors over multicores for stru...
The proliferation of accelerators, in particular GPUs, over the past decade is im- pacting the way s...
AbstractThe past decade has produced numerous CPU architectural innovations. These have included mul...
AbstractAn extreme form of pipelining of the Piecewise-Parabolic Method (PPM) gas dynamics code has ...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
Design optimization relies heavily on time-consuming simulations, especially when using gradient-fre...
High Performance Computing (HPC) platforms allow scientists to model computationally intensive algor...
This paper investigates the development of a molecular dynamics code that is highly portable between...
Key words: graphics processing units, unstructured finite volume method, computational fluid dynam...
Hardware trends over the last decade show increasing complexity and heterogeneity in high performanc...
We show how compiler technology can generate fast and efficient yet human-readable data-parallel sim...
Real-time simulations of smoke propagation during fires in complex geometries challenge engineers, p...
Accelerators are quickly emerging as the leading technology to further boost computing performances;...
Accelerator devices like the General Purpose Graphics Computing Units (GPGPUs) play an important rol...
Graphics processing units (GPUs) provide a low cost platform for accelerating high performance compu...
Abstract. Graphical Processing Units (GPUs) have shown acceleration factors over multicores for stru...
The proliferation of accelerators, in particular GPUs, over the past decade is im- pacting the way s...
AbstractThe past decade has produced numerous CPU architectural innovations. These have included mul...
AbstractAn extreme form of pipelining of the Piecewise-Parabolic Method (PPM) gas dynamics code has ...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
Design optimization relies heavily on time-consuming simulations, especially when using gradient-fre...
High Performance Computing (HPC) platforms allow scientists to model computationally intensive algor...
This paper investigates the development of a molecular dynamics code that is highly portable between...
Key words: graphics processing units, unstructured finite volume method, computational fluid dynam...
Hardware trends over the last decade show increasing complexity and heterogeneity in high performanc...
We show how compiler technology can generate fast and efficient yet human-readable data-parallel sim...
Real-time simulations of smoke propagation during fires in complex geometries challenge engineers, p...
Accelerators are quickly emerging as the leading technology to further boost computing performances;...
Accelerator devices like the General Purpose Graphics Computing Units (GPGPUs) play an important rol...
Graphics processing units (GPUs) provide a low cost platform for accelerating high performance compu...
Abstract. Graphical Processing Units (GPUs) have shown acceleration factors over multicores for stru...
The proliferation of accelerators, in particular GPUs, over the past decade is im- pacting the way s...