This paper explores the use of a simple linear performance model, that determines execution time based instruction and cache miss counts, for describing the behaviour of two-electron integral evaluation algorithm in the Gaussian computational chemistry package. Four different microarchitecture platforms are considered with a total of seven individual microprocessors. Both Hartree-Fock and hybrid Hartree-Fock/Density Functional Theory electronic structure methods are assessed. In most cases the model is found to be accurate to within 3%. Least agreement is for an Athlon64 system (ranging from 1.8% to 6.5%) and a periodic boundary computation on an Opteron where errors of up to 6.8% are observed. These errors arise as the model does not accou...
The use of global address space languages and one-sided communication for complex applications is ga...
We present a cache performance modeling methodology that facilitates the tuning of uniprocessor cach...
The computational efficiency and energy-to-solution of several applications using the GAMESS quantum...
AbstractThis paper explores the use of a simple linear performance model, that determines execution ...
Many situations call for an estimation of the execution time of applications, e.g., during design or...
We present a general criterion for theoretical performance assessment of algoritlnns for twoelectron...
TECHNIQUES FOR THE EXECUTION PROFILE ANALYSIS AND OPTIMIZATION OF COMPUTATIONAL CHEMISTRY PROGRAMS, ...
The numerical integration of the exchange–correlation (XC) potential is one of the primary computati...
Building fast, reliable, and adaptive software is a constant challenge for computational science, es...
<div><p>We report on scaling and timing tests of the SIESTA electronic structure code for <i>ab init...
Workload characterization has been proven an essential tool to architecture design and performance e...
We report on scaling and timing tests of the SIESTA electronic structure code for ab initio molecula...
In this work, we extend and evaluate a simple performance model to account for NUMA and bandwidth ef...
Abstract: We propose the algorithm to evaluate the Coulomb potential in the ab initio density functi...
FPGA devices used in the HPC context promise an increased energy efficiency, enhancing the computing...
The use of global address space languages and one-sided communication for complex applications is ga...
We present a cache performance modeling methodology that facilitates the tuning of uniprocessor cach...
The computational efficiency and energy-to-solution of several applications using the GAMESS quantum...
AbstractThis paper explores the use of a simple linear performance model, that determines execution ...
Many situations call for an estimation of the execution time of applications, e.g., during design or...
We present a general criterion for theoretical performance assessment of algoritlnns for twoelectron...
TECHNIQUES FOR THE EXECUTION PROFILE ANALYSIS AND OPTIMIZATION OF COMPUTATIONAL CHEMISTRY PROGRAMS, ...
The numerical integration of the exchange–correlation (XC) potential is one of the primary computati...
Building fast, reliable, and adaptive software is a constant challenge for computational science, es...
<div><p>We report on scaling and timing tests of the SIESTA electronic structure code for <i>ab init...
Workload characterization has been proven an essential tool to architecture design and performance e...
We report on scaling and timing tests of the SIESTA electronic structure code for ab initio molecula...
In this work, we extend and evaluate a simple performance model to account for NUMA and bandwidth ef...
Abstract: We propose the algorithm to evaluate the Coulomb potential in the ab initio density functi...
FPGA devices used in the HPC context promise an increased energy efficiency, enhancing the computing...
The use of global address space languages and one-sided communication for complex applications is ga...
We present a cache performance modeling methodology that facilitates the tuning of uniprocessor cach...
The computational efficiency and energy-to-solution of several applications using the GAMESS quantum...