A locality-based threading algorithm for the configuration-interaction method

Shan, H
Williams, S
Johnson, C
McElvain, K

Publication date

June 2017

Publisher

eScholarship, University of California

Abstract

The Configuration Interaction (CI) method has been widely used to solve the non-relativistic many-body Schrodinger equation. One great challenge to implementing it efficiently on manycore architectures is its immense memory and data movement requirements. To address this issue, within each node, we exploit a hybrid MPI+OpenMP programming model in lieu of the traditional flat MPI programming model. In this paper, we develop optimizations that partition the workloads among OpenMP threads based on data locality,-which is essential in ensuring applications with complex data access patterns scale well on manycore architectures. The new algorithm scales to 256 threadson the 64-core Intel Knights Landing (KNL) manycore processor and 24 threads on ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

A locality-based threading algorithm for the configuration-interaction method

Abstract

Extracted data

A locality-based threading algorithm for the configuration-interaction method

Abstract

Extracted data

Related items

Related items