International audienceHigh performance fast multipole method is crucial for the numerical simulation of many physical problems. In a previous study, we have shown that task-based fast multipole method provides the flexibility required to process a wide spectrum of particle distributions efficiently on multicore architectures. In this paper, we now show how such an approach can be extended to fully exploit heterogeneous platforms. For that, we design highly tuned graphics processing unit (GPU) versions of the two dominant operators P2P and M2L) as well as a scheduling strategy that dynamically decides which proportion of subsequent tasks is processed on regular CPU cores and on GPU accelerators. We assess our method with the StarPU runtime s...
International audienceFast Multipole Methods are a fundamental operation for the simulation of many ...
International audienceIn the field of HPC, the current hardware trend is to design multiprocessor ar...
International audienceTo fully tap into the potential of heterogeneous machines composed of multicor...
Preliminary version of a paper to appear in SIAM SISCFast Multipole Methods (FMM) are a fundamental ...
High performance \FMM is crucial for the numerical simulation of many physical problems. In a previo...
Fast Multipole Methods (FMM) are a fundamental operation for the simulation of many physical problem...
GPUs have largely entered HPC clusters, as shown by the top entries of the latest top500 issue. Expl...
Multicore machines equipped with accelerators are becoming increasingly popular. The TOP500-leading ...
International audienceThe Fast Multipole Method (FMM) is considered as one of the top ten algorithms...
This thesis deals with heterogeneous architectures in standard workstations. Heterogeneous arch...
The N-body problem appears in many computational physics simulations. At each time step the computat...
The significant increase of the hardware complexity that occurred in the last few years led the high...
<b>Invited Lecture at the SIAM <i>"Encuentro Nacional de Ingeniería Matemática,"</i> at Pontificia U...
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...
International audienceLearn about the fast multipole method (FMM) and its optimization on NVIDIA GPU...
International audienceFast Multipole Methods are a fundamental operation for the simulation of many ...
International audienceIn the field of HPC, the current hardware trend is to design multiprocessor ar...
International audienceTo fully tap into the potential of heterogeneous machines composed of multicor...
Preliminary version of a paper to appear in SIAM SISCFast Multipole Methods (FMM) are a fundamental ...
High performance \FMM is crucial for the numerical simulation of many physical problems. In a previo...
Fast Multipole Methods (FMM) are a fundamental operation for the simulation of many physical problem...
GPUs have largely entered HPC clusters, as shown by the top entries of the latest top500 issue. Expl...
Multicore machines equipped with accelerators are becoming increasingly popular. The TOP500-leading ...
International audienceThe Fast Multipole Method (FMM) is considered as one of the top ten algorithms...
This thesis deals with heterogeneous architectures in standard workstations. Heterogeneous arch...
The N-body problem appears in many computational physics simulations. At each time step the computat...
The significant increase of the hardware complexity that occurred in the last few years led the high...
<b>Invited Lecture at the SIAM <i>"Encuentro Nacional de Ingeniería Matemática,"</i> at Pontificia U...
The Fast Multipole Method allows the rapid evaluation of sums of radial basis functions centered at ...
International audienceLearn about the fast multipole method (FMM) and its optimization on NVIDIA GPU...
International audienceFast Multipole Methods are a fundamental operation for the simulation of many ...
International audienceIn the field of HPC, the current hardware trend is to design multiprocessor ar...
International audienceTo fully tap into the potential of heterogeneous machines composed of multicor...