The numerical integration of the exchange–correlation (XC) potential is one of the primary computational bottlenecks in Gaussian basis set Kohn–Sham density functional theory (KS-DFT). To achieve optimal performance and accuracy, care must be taken in this numerical integration to preserve local sparsity as to allow for near linear weak scaling with system size. This leads to an integration scheme with several performance critical kernels which must be hand optimized for each architecture of interest. As the set of available accelerator hardware goes more diverse, a key challenge for developers of KS-DFT software is to maintain performance portability across a wide range of computational architectures. In this work, we examine a modular sof...
This chapter describes the graphics processing unit (GPU) acceleration of density functional theory ...
We describe an extension of our graphics processing unit (GPU) electronic structure program TeraChem...
AbstractThis paper explores the use of a simple linear performance model, that determines execution ...
The numerical integration of the exchange–correlation (XC) potential is one of the primary computati...
The numerical integration of the exchange–correlation (XC) potential is one of the primary computati...
The predominance of Kohn-Sham density functional theory (KS-DFT) for the theoretical treatment of la...
The predominance of Kohn-Sham density functional theory (KS-DFT) for the theoretical treatment of la...
The predominance of Kohn-Sham density functional theory (KS-DFT) for the theoretical treatment of la...
Materials research is an area that is expected to strongly benefit from the growing performance capa...
Paper presented at CUG 2010, EdinburghCP2K is a freely available and increasingly popular Density Fu...
Addressing certain materials science problems, e.g. inhomogeneous materials, using Density Functiona...
GPAW is a versatile software package for first-principles simulations of nanostructures utilizing de...
This paper presents a Field Programmable Gate Array (FPGA) implementation of a calculation module fo...
This paper presents a Field Programmable Gate Array (FPGA) implementation of a calculation module fo...
GPAW is a versatile software package for first-principles simulations of nanostructures utilizing de...
This chapter describes the graphics processing unit (GPU) acceleration of density functional theory ...
We describe an extension of our graphics processing unit (GPU) electronic structure program TeraChem...
AbstractThis paper explores the use of a simple linear performance model, that determines execution ...
The numerical integration of the exchange–correlation (XC) potential is one of the primary computati...
The numerical integration of the exchange–correlation (XC) potential is one of the primary computati...
The predominance of Kohn-Sham density functional theory (KS-DFT) for the theoretical treatment of la...
The predominance of Kohn-Sham density functional theory (KS-DFT) for the theoretical treatment of la...
The predominance of Kohn-Sham density functional theory (KS-DFT) for the theoretical treatment of la...
Materials research is an area that is expected to strongly benefit from the growing performance capa...
Paper presented at CUG 2010, EdinburghCP2K is a freely available and increasingly popular Density Fu...
Addressing certain materials science problems, e.g. inhomogeneous materials, using Density Functiona...
GPAW is a versatile software package for first-principles simulations of nanostructures utilizing de...
This paper presents a Field Programmable Gate Array (FPGA) implementation of a calculation module fo...
This paper presents a Field Programmable Gate Array (FPGA) implementation of a calculation module fo...
GPAW is a versatile software package for first-principles simulations of nanostructures utilizing de...
This chapter describes the graphics processing unit (GPU) acceleration of density functional theory ...
We describe an extension of our graphics processing unit (GPU) electronic structure program TeraChem...
AbstractThis paper explores the use of a simple linear performance model, that determines execution ...