We describe the porting of PWscf (Plane-Wave Self Consistent Field), a key component of the Quantum ESPRESSO open-source suite of codes for materials modeling, to GPU systems using CUDA Fortran. Kernel loop directives (CUF kernels) have been extensively used in order to have a single source code for both CPU and GPU implementations. The results of the GPU version have been carefully validated and the performance of the code on several GPU systems (both x86 and POWER8 based) has been compared with traditional Intel multi-core (CPU only) systems. This current GPU version can reduce the time-to-solution by an average factor of 2 12 3 running two different input cases widely used as benchmarks on small and large high performance computing syst...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
Using two full applications with different characteristics, this thesis explores the performance and...
Product data parallel GPU processor has recently attracted many application developers attention. GP...
We describe the porting of PWscf (Plane-Wave Self Consistent Field), a key component of the Quantum ...
We explore the diagonalization methods used in the PWscf (Plane-Wave Self Consistent Field), a key ...
AbstractThe past decade has produced numerous CPU architectural innovations. These have included mul...
A comparison of PGI OpenACC, FORTRAN CUDA, and Nvidia CUDA pseudospectral methods on a single GPU an...
Scientific computing applications demand ever-increasing performance while traditional microprocesso...
The proliferation of accelerators, in particular GPUs, over the past decade is im- pacting the way s...
Over the past few years, we have seen an exponential performance boost of the graphics processing un...
In this paper we present development work carried out on Quantum ESPRESSO [1] software package withi...
There is a growing need for ever more accurate climate and weather simulations to be delivered in sh...
Quantum ESPRESSO is an integrated suite of open-source computer codes for quantum simulations of ma...
Over the last 20 years, the computing revolution has created many social benefits. The computing ene...
GPUs as general purpose processors already are well adopted in scien-tific and high performance comp...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
Using two full applications with different characteristics, this thesis explores the performance and...
Product data parallel GPU processor has recently attracted many application developers attention. GP...
We describe the porting of PWscf (Plane-Wave Self Consistent Field), a key component of the Quantum ...
We explore the diagonalization methods used in the PWscf (Plane-Wave Self Consistent Field), a key ...
AbstractThe past decade has produced numerous CPU architectural innovations. These have included mul...
A comparison of PGI OpenACC, FORTRAN CUDA, and Nvidia CUDA pseudospectral methods on a single GPU an...
Scientific computing applications demand ever-increasing performance while traditional microprocesso...
The proliferation of accelerators, in particular GPUs, over the past decade is im- pacting the way s...
Over the past few years, we have seen an exponential performance boost of the graphics processing un...
In this paper we present development work carried out on Quantum ESPRESSO [1] software package withi...
There is a growing need for ever more accurate climate and weather simulations to be delivered in sh...
Quantum ESPRESSO is an integrated suite of open-source computer codes for quantum simulations of ma...
Over the last 20 years, the computing revolution has created many social benefits. The computing ene...
GPUs as general purpose processors already are well adopted in scien-tific and high performance comp...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
Using two full applications with different characteristics, this thesis explores the performance and...
Product data parallel GPU processor has recently attracted many application developers attention. GP...