Profiling tools such as gprof and ssrun are used to analyze the run-time performance of a scientific application. The profiling is done in serial and in parallel mode using MPI as the communication interface. The application is a quantum chemistry program using Hartree Fock theory and Pulays DIIS method. An extensive set of test cases is taken into account in order to reach uniform conclusions. A known problem with decreased parallel scalability can thus be narrowed down to a single subroutine responsible for the reduction in Speed Up. The critical module is analyzed and a typical pitfall with triple matrix multiplications is identified. After overhauling the critical subroutine re-examination of the run-time behavior shows significantly im...
Using Amdahl’s law as a metric, the authors illustrate a technique for developing efficient code on ...
In recent years, graphical processing units (GPUs) have generated a lot of excitement in computation...
h i g h l i g h t s • We discuss computational bottlenecks in MD and discuss challenges in paralleli...
Quantum chemistry calculations have a compelling requirement for one-sided communication given their...
MPQC (Massively Parallel Quantum Chemistry) is a widely used computational quantum chemistry code. I...
This study has focused on an application for Quantum Hall Transport simulations and more specificall...
For many years the performance of scientific softwares has been one of the keys to expand the fronti...
The amount of parallelism in modern supercomputers currently grows from generation to generation. Fu...
This thesis, whose topic is quantum chemistry algorithms, is made in the context of the change in pa...
A parallel realization of the NDDO-WF technique for semi-empirical quantum-chemical calculations on ...
Many/multi-core supercomputers provide a natural programming paradigm for hybrid MPI/OpenMP scientif...
Key factors affecting the parallel efficiency of archetypical quantum-chemical calculations are disc...
In this work, we present a parallel approach to complete and restricted active space second-order pe...
GPAW is a versatile software package for first-principles simulations of nanostructures utilizing de...
The DDSCAT software is enabled for use of MPI or OpenMP to distribute calculation of different parti...
Using Amdahl’s law as a metric, the authors illustrate a technique for developing efficient code on ...
In recent years, graphical processing units (GPUs) have generated a lot of excitement in computation...
h i g h l i g h t s • We discuss computational bottlenecks in MD and discuss challenges in paralleli...
Quantum chemistry calculations have a compelling requirement for one-sided communication given their...
MPQC (Massively Parallel Quantum Chemistry) is a widely used computational quantum chemistry code. I...
This study has focused on an application for Quantum Hall Transport simulations and more specificall...
For many years the performance of scientific softwares has been one of the keys to expand the fronti...
The amount of parallelism in modern supercomputers currently grows from generation to generation. Fu...
This thesis, whose topic is quantum chemistry algorithms, is made in the context of the change in pa...
A parallel realization of the NDDO-WF technique for semi-empirical quantum-chemical calculations on ...
Many/multi-core supercomputers provide a natural programming paradigm for hybrid MPI/OpenMP scientif...
Key factors affecting the parallel efficiency of archetypical quantum-chemical calculations are disc...
In this work, we present a parallel approach to complete and restricted active space second-order pe...
GPAW is a versatile software package for first-principles simulations of nanostructures utilizing de...
The DDSCAT software is enabled for use of MPI or OpenMP to distribute calculation of different parti...
Using Amdahl’s law as a metric, the authors illustrate a technique for developing efficient code on ...
In recent years, graphical processing units (GPUs) have generated a lot of excitement in computation...
h i g h l i g h t s • We discuss computational bottlenecks in MD and discuss challenges in paralleli...