We present a hybrid MPI-OpenMP implementation of Linear-Scaling Density Functional Theory within the ONETEP code. We illustrate its performance on a range of high performance computing (HPC) platforms comprising shared-memory nodes with fast interconnect. Our work has focused on applying OpenMP parallelism to the routines which dominate the computational load, attempting where possible to parallelize different loops from those already parallelized within MPI. This includes 3D FFT box operations, sparse matrix algebra operations, calculation of integrals, and Ewald summation. While the underlying numerical methods are unchanged, these developments represent significant changes to the algorithms used within ONETEP to distribute the workload a...
Computer simulations of experimentally comparable system sizes in soft matter often require consider...
AbstractThe capabilities of polymer science and computational chemistry are reaching a point of conv...
This chapter demonstrates how graphics processing units (GPUs) can be used to accelerate large-scale...
We present a hybrid MPI-OpenMP implementation of Linear-Scaling Density Functional Theory within the...
We discuss the computational bottlenecks in molecular dynamics (MD) and describe the challenges in p...
Analytic bond-order potentials (BOPs) allow to obtain a highly accurate description of interatomic i...
© 2013 American Physical SocietyThis article introduces a highly parallel algorithm for molecular dy...
Significant improvements are presented for the molecular dynamics code ls1 mardyn — a linked cell-ba...
The most widely used technique to allow for parallel simulations in molecular dynamics is spatial do...
We show teraflop performance of the fully featured ab initio molecular dynamics code CPMD on an IBM ...
Today's supercomputers often consists of clusters of SMP nodes. Both OpenMP and MPI are programming ...
The increasing number of processing elements and decreasing memory to core ratio in modern high-perf...
We present ONETEP (order-N electronic total energy package), a density functional program for parall...
The Extended Discrete Element Method (XDEM) is a novel and innovative numerical simulation technique...
h i g h l i g h t s • We discuss computational bottlenecks in MD and discuss challenges in paralleli...
Computer simulations of experimentally comparable system sizes in soft matter often require consider...
AbstractThe capabilities of polymer science and computational chemistry are reaching a point of conv...
This chapter demonstrates how graphics processing units (GPUs) can be used to accelerate large-scale...
We present a hybrid MPI-OpenMP implementation of Linear-Scaling Density Functional Theory within the...
We discuss the computational bottlenecks in molecular dynamics (MD) and describe the challenges in p...
Analytic bond-order potentials (BOPs) allow to obtain a highly accurate description of interatomic i...
© 2013 American Physical SocietyThis article introduces a highly parallel algorithm for molecular dy...
Significant improvements are presented for the molecular dynamics code ls1 mardyn — a linked cell-ba...
The most widely used technique to allow for parallel simulations in molecular dynamics is spatial do...
We show teraflop performance of the fully featured ab initio molecular dynamics code CPMD on an IBM ...
Today's supercomputers often consists of clusters of SMP nodes. Both OpenMP and MPI are programming ...
The increasing number of processing elements and decreasing memory to core ratio in modern high-perf...
We present ONETEP (order-N electronic total energy package), a density functional program for parall...
The Extended Discrete Element Method (XDEM) is a novel and innovative numerical simulation technique...
h i g h l i g h t s • We discuss computational bottlenecks in MD and discuss challenges in paralleli...
Computer simulations of experimentally comparable system sizes in soft matter often require consider...
AbstractThe capabilities of polymer science and computational chemistry are reaching a point of conv...
This chapter demonstrates how graphics processing units (GPUs) can be used to accelerate large-scale...