This article describes how we manage to increase performance and to extend features of a large parallel application through the use of simultaneous multithreading (SMT) and by designing a robust parallel transpose algorithm. The semi-Lagrangian code Gysela typically performs large physics simulations using a few thousands of cores, between 1k cores up to 16k on x86-based clusters. However, simulations with finer resolutions and with kinetic electrons increase those needs by a huge factor, providing a good example of applications requiring Exascale machines. To improve Gysela compute times, we take advantage of efficient SMT implementations available on recent INTEL architectures. We also analyze the cost of a transposition communication sch...
One of the important phenomena in magnetically-confined fusion plasma is plasma turbulence, which ca...
Nowadays, the most powerful supercomputers in the world, needed for solving complex models and simu...
International audienceModeling turbulent transport is a major goal in order to predict confinement p...
This article describes how we manage to increase performance and to extend features of a large paral...
International audienceGyrokinetic simulations lead to huge computational needs. Up to now, the Semi-...
International audienceGyrokinetic simulations lead to huge computational needs. Up to now, the semi-...
Gyrokinetic simulations lead to huge computational needs. Up to now, the semi- Lagrangian co...
Communication and computation overlapping techniques have been introduced in the five‐dimensional gy...
Gyrokinetic simulations lead to huge computational needs. Up to now, the semi- Lagrangian co...
Modeling turbulent transport is a major goal in order to predict confinement issues in a tokamak pla...
International audienceThe current generation of the Xeon Phi Knights Landing (KNL) processor provide...
A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems Rio Yokota...
The DDSCAT software is enabled for use of MPI or OpenMP to distribute calculation of different parti...
International audienceThis work describes the challenges presented by porting parts of the gysela co...
The research presented in this thesis investigates parallel implementations of the Fast Sweeping Met...
One of the important phenomena in magnetically-confined fusion plasma is plasma turbulence, which ca...
Nowadays, the most powerful supercomputers in the world, needed for solving complex models and simu...
International audienceModeling turbulent transport is a major goal in order to predict confinement p...
This article describes how we manage to increase performance and to extend features of a large paral...
International audienceGyrokinetic simulations lead to huge computational needs. Up to now, the Semi-...
International audienceGyrokinetic simulations lead to huge computational needs. Up to now, the semi-...
Gyrokinetic simulations lead to huge computational needs. Up to now, the semi- Lagrangian co...
Communication and computation overlapping techniques have been introduced in the five‐dimensional gy...
Gyrokinetic simulations lead to huge computational needs. Up to now, the semi- Lagrangian co...
Modeling turbulent transport is a major goal in order to predict confinement issues in a tokamak pla...
International audienceThe current generation of the Xeon Phi Knights Landing (KNL) processor provide...
A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems Rio Yokota...
The DDSCAT software is enabled for use of MPI or OpenMP to distribute calculation of different parti...
International audienceThis work describes the challenges presented by porting parts of the gysela co...
The research presented in this thesis investigates parallel implementations of the Fast Sweeping Met...
One of the important phenomena in magnetically-confined fusion plasma is plasma turbulence, which ca...
Nowadays, the most powerful supercomputers in the world, needed for solving complex models and simu...
International audienceModeling turbulent transport is a major goal in order to predict confinement p...