Substantial time is spent on building, optimizing and maintaining large-scale software that is run on supercomputers. However, little has been done to utilize overall resources efficiently when it comes to including expensive human resources. The community is beginning to acknowledge that optimizing the hardware performance such as speed and memory bottlenecks contributes less to the overall productivity than does the development lifecycle of high-performance scientific applications. Researchers are beginning to look at overall scientific workflows for high performance computing. Scientific programming productivity is measured by time and effort required to develop, configure, and maintain a simulation experiment and its constituent parts, ...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
Projecting performance of applications and hardware is important to several market segments—hardware...
During the past decade, accelerators, such as NVIDIA CUDA GPUs and Intel Xeon Phis, have seen an inc...
Substantial time is spent on building, optimizing and maintaining large-scale software that is run o...
Substantial time is spent on building, optimizing and maintaining large-scale software that is run o...
In the high performance computing domain, the speed of execution of a program has typically been the...
OpenACC has been touted as a "high productivity" API designed to make GPGPU programming accessible t...
This thesis deals with how to develop scientific computing software that runs efficiently on multico...
The next-generation of supercomputers will feature a diverse mix of accelerator devices. The increas...
The use of graphical processing units (GPUs) for general purpose calculations has gained a lot of at...
The ability to write programs that execute efficiently on modern parallel computers has not been fu...
Ever increasing demands for computational power are concomitant with rising electrical power needs a...
We have developed a performance bounding methodology that explains the performance of loop-dominated...
In developing High-Performance Computing (HPC) software, time to solution is an important metric. Th...
When writing computer software one is often forced to balance the need for high run-time performance...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
Projecting performance of applications and hardware is important to several market segments—hardware...
During the past decade, accelerators, such as NVIDIA CUDA GPUs and Intel Xeon Phis, have seen an inc...
Substantial time is spent on building, optimizing and maintaining large-scale software that is run o...
Substantial time is spent on building, optimizing and maintaining large-scale software that is run o...
In the high performance computing domain, the speed of execution of a program has typically been the...
OpenACC has been touted as a "high productivity" API designed to make GPGPU programming accessible t...
This thesis deals with how to develop scientific computing software that runs efficiently on multico...
The next-generation of supercomputers will feature a diverse mix of accelerator devices. The increas...
The use of graphical processing units (GPUs) for general purpose calculations has gained a lot of at...
The ability to write programs that execute efficiently on modern parallel computers has not been fu...
Ever increasing demands for computational power are concomitant with rising electrical power needs a...
We have developed a performance bounding methodology that explains the performance of loop-dominated...
In developing High-Performance Computing (HPC) software, time to solution is an important metric. Th...
When writing computer software one is often forced to balance the need for high run-time performance...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
Projecting performance of applications and hardware is important to several market segments—hardware...
During the past decade, accelerators, such as NVIDIA CUDA GPUs and Intel Xeon Phis, have seen an inc...