Fast Multipole Methods (FMM) are a fundamental operation for the simulation of many physical problems. The high performance design of such methods usually requires to carefully tune the algorithm for both the targeted physics and the hardware. In this paper, we propose a new approach that achieves high performance across architectures. Our method consists of expressing the FMM algorithm as a task flow and employing a state-of-the-art runtime system, StarPU, in order to process the tasks on the different processing units. We carefully design the task flow, the mathematical operators, their Central Processing Unit (CPU) and Graphics Processing Unit (GPU) implementations, as well as scheduling schemes. We compute potentials and forces of 200 m...
International audienceThis report presents a study of techniques used to speedup a scientific simula...
International audienceFast multipole methods (FMM) usually require a careful tuning of the algorithm...
In recent years, the research focus has moved from core microarchitecture to uncore microarchitectur...
Preliminary version of a paper to appear in SIAM SISCFast Multipole Methods (FMM) are a fundamental ...
High performance \FMM is crucial for the numerical simulation of many physical problems. In a previo...
The significant increase of the hardware complexity that occurred in the last few years led the high...
Most high-performance, scientific libraries have adopted hybrid parallelization schemes - such as t...
Whereas most today parallel High Performance Computing (HPC) software is written as highly tuned cod...
NUMA nodes are potentially powerful but taking benefit of their capabilities is challenging due to t...
Task-based systems have gained popularity because of their promise of exploiting the computational p...
Preliminary version of a paper published in SIAM SISC 2014The design of scalable parallel simulation...
The emergence of accelerators as standard computing resources on supercomputers and the subsequent a...
National audienceThe race for ever more computing power raises the issue of supercomputers' power co...
In this thesis, we are interested in solving the Vlasov–Poisson system of equations (useful in the d...
Next generation high performance computers will massively use virtualization as a way to share hardw...
International audienceThis report presents a study of techniques used to speedup a scientific simula...
International audienceFast multipole methods (FMM) usually require a careful tuning of the algorithm...
In recent years, the research focus has moved from core microarchitecture to uncore microarchitectur...
Preliminary version of a paper to appear in SIAM SISCFast Multipole Methods (FMM) are a fundamental ...
High performance \FMM is crucial for the numerical simulation of many physical problems. In a previo...
The significant increase of the hardware complexity that occurred in the last few years led the high...
Most high-performance, scientific libraries have adopted hybrid parallelization schemes - such as t...
Whereas most today parallel High Performance Computing (HPC) software is written as highly tuned cod...
NUMA nodes are potentially powerful but taking benefit of their capabilities is challenging due to t...
Task-based systems have gained popularity because of their promise of exploiting the computational p...
Preliminary version of a paper published in SIAM SISC 2014The design of scalable parallel simulation...
The emergence of accelerators as standard computing resources on supercomputers and the subsequent a...
National audienceThe race for ever more computing power raises the issue of supercomputers' power co...
In this thesis, we are interested in solving the Vlasov–Poisson system of equations (useful in the d...
Next generation high performance computers will massively use virtualization as a way to share hardw...
International audienceThis report presents a study of techniques used to speedup a scientific simula...
International audienceFast multipole methods (FMM) usually require a careful tuning of the algorithm...
In recent years, the research focus has moved from core microarchitecture to uncore microarchitectur...