Following the trend towards Exascale, today’s supercomputers consist of increasingly complex and heterogeneous compute nodes. To exploit the performance of these systems, research software in HPC needs to keep up with the rapid development of hardware architectures. Since manual tuning of software to each and every architecture is neither sustainable nor viable, we aim to tackle this challenge through appropriate software design. In this article, we aim to improve the performance and sustainability of FMSolvr, a parallel Fast Multipole Method for Molecular Dynamics, by adapting it to Non-Uniform Memory Access architectures in a portable and maintainable way. The parallelization of FMSolvr is based on Eventify, an event-based tasking framewo...
A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems Rio Yokota...
Fast summation methods like the FMM are the backbone of a multitude of simulations in MD, astrophysi...
International audienceNon Uniform Memory Access (NUMA) architectures are nowadays common for running...
Molecular dynamics (MD) has become a vital research method in biochemistry and materials science. GR...
Among the algorithms that are likely to play a major role in future exascale computing, the fast mul...
Embedded manycore architectures are often organized as fabrics of tightly-coupled shared memory clus...
Modern computing platforms are based on multi-processor/multi-core technology. This allows running a...
International audienceNowadays, on Multi-core Multiprocessors with Hierarchical Memory (Non-Uniform ...
It is well known that Time Warp may suffer from large usage of memory, which may hamper the efficien...
Multi-core platforms with non-uniform memory access (NUMA) design are now a common resource in High ...
Processors with multiple sockets or chiplets are becoming more conventional. These kinds of processo...
Multi-core nodes with Non-Uniform Memory Access (NUMA) are now a common architecture for high perfor...
Non-uniform memory access (NUMA) architectures are modern shared-memory, multi-core machines offerin...
Current multi-socket systems have complex memory hierarchies with significant Non-Uniform Memory Acc...
A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems Rio Yokota...
Fast summation methods like the FMM are the backbone of a multitude of simulations in MD, astrophysi...
International audienceNon Uniform Memory Access (NUMA) architectures are nowadays common for running...
Molecular dynamics (MD) has become a vital research method in biochemistry and materials science. GR...
Among the algorithms that are likely to play a major role in future exascale computing, the fast mul...
Embedded manycore architectures are often organized as fabrics of tightly-coupled shared memory clus...
Modern computing platforms are based on multi-processor/multi-core technology. This allows running a...
International audienceNowadays, on Multi-core Multiprocessors with Hierarchical Memory (Non-Uniform ...
It is well known that Time Warp may suffer from large usage of memory, which may hamper the efficien...
Multi-core platforms with non-uniform memory access (NUMA) design are now a common resource in High ...
Processors with multiple sockets or chiplets are becoming more conventional. These kinds of processo...
Multi-core nodes with Non-Uniform Memory Access (NUMA) are now a common architecture for high perfor...
Non-uniform memory access (NUMA) architectures are modern shared-memory, multi-core machines offerin...
Current multi-socket systems have complex memory hierarchies with significant Non-Uniform Memory Acc...
A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems Rio Yokota...
Fast summation methods like the FMM are the backbone of a multitude of simulations in MD, astrophysi...
International audienceNon Uniform Memory Access (NUMA) architectures are nowadays common for running...