International audienceThis paper presents the performance portable implementation of a kinetic plasma simulation code with C++ parallel algorithm to run across multiple CPUs and GPUs. Relying on the language standard parallelism stdpar and proposed language standard multi-dimensional array support mdspan, we demonstrate that a performance portable implementation is possible without harming the readability and productivity. We obtain a good overall performance for a mini-application in the range of 20 % to the Kokkos version on Intel Icelake, NVIDIA V100, and A100 GPUs. Our conclusion is that stdpar can be a good candidate to develop a performance portable and productive code targeting the Exascale era platform, assuming this approach will b...
The simulation of parallel heterogeneous architectures such as multi-cores and GPUs sets new challen...
The alpaka library defines and implements an abstract hierarchical redundant parallelism model. This...
e have developed a new algorithm for implementation of plasma particle-in-cell (PIC) simulation code...
There are multiple applications of plasma. To better understand the fundamental characteristics of p...
Performance portability is considered to be an inevitable requirementin the exascale era. We explore...
WACCPD 2019: International Workshop on Accelerator Programming Using Directivesisbn 978-3-030-49943-...
As High Energy Physics collider experiments continue to push the boundaries of instantaneous luminos...
With the appearance of the heterogeneous platform OpenPower,many-core accelerator devices have been ...
This paper reports on an in-depth evaluation of the performance portability frameworks Kokkos and RA...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
JuSPIC is a particle-in-cell (PIC) code, developed in the Simulation Lab for Plasma Physics of the J...
The article describes various options for speeding up calculations on computer systems. These featur...
International audienceThis paper describes a parallel implementation of a numerical solver for the V...
A decade after the beginning of the many-core era, multi-core CPU and GPU architectures are everywhe...
International audienceUnderstanding turbulent transport in magnetised plasmas is a subject of major ...
The simulation of parallel heterogeneous architectures such as multi-cores and GPUs sets new challen...
The alpaka library defines and implements an abstract hierarchical redundant parallelism model. This...
e have developed a new algorithm for implementation of plasma particle-in-cell (PIC) simulation code...
There are multiple applications of plasma. To better understand the fundamental characteristics of p...
Performance portability is considered to be an inevitable requirementin the exascale era. We explore...
WACCPD 2019: International Workshop on Accelerator Programming Using Directivesisbn 978-3-030-49943-...
As High Energy Physics collider experiments continue to push the boundaries of instantaneous luminos...
With the appearance of the heterogeneous platform OpenPower,many-core accelerator devices have been ...
This paper reports on an in-depth evaluation of the performance portability frameworks Kokkos and RA...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
JuSPIC is a particle-in-cell (PIC) code, developed in the Simulation Lab for Plasma Physics of the J...
The article describes various options for speeding up calculations on computer systems. These featur...
International audienceThis paper describes a parallel implementation of a numerical solver for the V...
A decade after the beginning of the many-core era, multi-core CPU and GPU architectures are everywhe...
International audienceUnderstanding turbulent transport in magnetised plasmas is a subject of major ...
The simulation of parallel heterogeneous architectures such as multi-cores and GPUs sets new challen...
The alpaka library defines and implements an abstract hierarchical redundant parallelism model. This...
e have developed a new algorithm for implementation of plasma particle-in-cell (PIC) simulation code...