AbstractThe past decade has produced numerous CPU architectural innovations. These have included multiple cores per CPU, multiple simultaneous threads per core, and, especially with GPUs, highly complex memory hierarchies. As a result, performance porta-bility has become a major challenge to programmers. We identify the SIMD engines in modern CPU and GPU cores as the key to obtaining high performance for scientific application codes. This common element of all present computing devices makes performance portability possible. However, we find that achieving this performance requires us to express the code in terms of intrinsic functions for the SIMD engine instructions, and these functions are different for each device. To assist the program...
With serial, or sequential, computational operations\u27 growth rate slowing over the past few years...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
Graphics processor units (GPU) that are originally designed for graphics rendering have emerged as m...
AbstractThe past decade has produced numerous CPU architectural innovations. These have included mul...
AbstractAn extreme form of pipelining of the Piecewise-Parabolic Method (PPM) gas dynamics code has ...
Hardware trends over the last decade show increasing complexity and heterogeneity in high performanc...
Design optimization relies heavily on time-consuming simulations, especially when using gradient-fre...
We introduce algorithmic advancements designed to expedite simulations in OpenFOAM using GPUs. These...
This paper investigates the development of a molecular dynamics code that is highly portable between...
Recent advance of the technologies incorporated in graphics hardware has enabled general-purpose com...
A new high-performance general-purpose graphics processing unit (GPGPU) computational fluid dynamics...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
In recent years, multi-core processors have come to dominate the field in desktop and high performan...
Graphics Processing Units (GPUs) are now commonplace in computing systems and are the most successf...
Abstract. Graphical Processing Units (GPUs) have shown acceleration factors over multicores for stru...
With serial, or sequential, computational operations\u27 growth rate slowing over the past few years...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
Graphics processor units (GPU) that are originally designed for graphics rendering have emerged as m...
AbstractThe past decade has produced numerous CPU architectural innovations. These have included mul...
AbstractAn extreme form of pipelining of the Piecewise-Parabolic Method (PPM) gas dynamics code has ...
Hardware trends over the last decade show increasing complexity and heterogeneity in high performanc...
Design optimization relies heavily on time-consuming simulations, especially when using gradient-fre...
We introduce algorithmic advancements designed to expedite simulations in OpenFOAM using GPUs. These...
This paper investigates the development of a molecular dynamics code that is highly portable between...
Recent advance of the technologies incorporated in graphics hardware has enabled general-purpose com...
A new high-performance general-purpose graphics processing unit (GPGPU) computational fluid dynamics...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
In recent years, multi-core processors have come to dominate the field in desktop and high performan...
Graphics Processing Units (GPUs) are now commonplace in computing systems and are the most successf...
Abstract. Graphical Processing Units (GPUs) have shown acceleration factors over multicores for stru...
With serial, or sequential, computational operations\u27 growth rate slowing over the past few years...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
Graphics processor units (GPU) that are originally designed for graphics rendering have emerged as m...