The end of Dennard scaling and the slowdown of Moore's law led to a shift in technology trends toward parallel architectures, particularly in HPC systems. To continue providing performance benefits, HPC should embrace Approximate Computing (AC), which trades application quality loss for improved performance. However, existing AC techniques have not been extensively applied and evaluated in state-of-the-art hardware architectures such as GPUs, the primary execution vehicle for HPC applications today. This paper presents HPAC-Offload, a pragma-based programming model that extends OpenMP offload applications to support AC techniques, allowing portable approximations across different GPU architectures. We conduct a comprehensive performance a...
Performance modeling, the science of understanding and predicting application performance, is import...
Over the past years, GPUs became ubiquitous in HPC installations around the world. Today, they provi...
Graphic processors are becoming faster and faster. Computational power within graphic processing uni...
This is an implementation of the HPAC-Offload programming model that equips OpenMP offload applicati...
Graphics Processing Units (GPU) have been widely adopted to accelerate the execution of HPC workload...
Cavazos, JohnAs the high-performance computing (HPC) community continues the push towards exascale ...
With processor clock speeds having stagnated, parallel computing architectures have achieved a break...
In High-Performance Computing (HPC), GPU-based accelerators are pervasive for two reasons: first, GP...
This report highlights our work on improving GPU parallelization by supporting compute nodes with mu...
This paper presents HALO 1.0, an open-ended extensible multi-agent software framework that implement...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
While architectures and programming models have remained relatively stable for almost two decades, n...
HPC as a service (HPCaaS) is a new way to expose HPC resources via cloud services. However, continue...
The field of high-performance computing (HPC) has always dealt with the bleeding edge of computation...
Just five years ago, NVIDIA introduced CUDA, the Compute Unified Device Architecture, which signifi...
Performance modeling, the science of understanding and predicting application performance, is import...
Over the past years, GPUs became ubiquitous in HPC installations around the world. Today, they provi...
Graphic processors are becoming faster and faster. Computational power within graphic processing uni...
This is an implementation of the HPAC-Offload programming model that equips OpenMP offload applicati...
Graphics Processing Units (GPU) have been widely adopted to accelerate the execution of HPC workload...
Cavazos, JohnAs the high-performance computing (HPC) community continues the push towards exascale ...
With processor clock speeds having stagnated, parallel computing architectures have achieved a break...
In High-Performance Computing (HPC), GPU-based accelerators are pervasive for two reasons: first, GP...
This report highlights our work on improving GPU parallelization by supporting compute nodes with mu...
This paper presents HALO 1.0, an open-ended extensible multi-agent software framework that implement...
The relentless demands for improvements in the compute throughput, and energy efficiency have driven...
While architectures and programming models have remained relatively stable for almost two decades, n...
HPC as a service (HPCaaS) is a new way to expose HPC resources via cloud services. However, continue...
The field of high-performance computing (HPC) has always dealt with the bleeding edge of computation...
Just five years ago, NVIDIA introduced CUDA, the Compute Unified Device Architecture, which signifi...
Performance modeling, the science of understanding and predicting application performance, is import...
Over the past years, GPUs became ubiquitous in HPC installations around the world. Today, they provi...
Graphic processors are becoming faster and faster. Computational power within graphic processing uni...