With the appearance of the heterogeneous platform OpenPower,many-core accelerator devices have been coupled with Power host processors for the first time. Towards utilizing their full potential, it is worth investigating performance portable algorithms that allow to choose the best-fitting hardware for each domain-specific compute task. Suiting even the high level of parallelism on modern GPGPUs, our presented approach relies heavily on abstract meta-programming techniques, which are essential to focus on fine-grained tuning rather than code porting. With this in mind, the CUDA-based open-source plasma simulation code PIConGPU is currently being abstracted to support the heterogeneous OpenPower platform using our fast porting interface cupl...
We present a portable platform, called PIC_ENGINE, for accelerating Particle-In-Cell (PIC) codes on ...
Heterogeneous architectures are increasingly common in modern High-Performance Computing (HPC) syste...
Using two full applications with different characteristics, this thesis explores the performance and...
With the appearance of the heterogeneous platform OpenPower,many-core accelerator devices have been ...
PIConGPU is an open source, multi-platform particle-in-cell code scaling to the fastest supercompute...
This is the archive containing the software used for evaluations in the publication "Performance-Por...
WACCPD 2019: International Workshop on Accelerator Programming Using Directivesisbn 978-3-030-49943-...
Performance portability is considered to be an inevitable requirementin the exascale era. We explore...
The alpaka library defines and implements an abstract hierarchical redundant parallelism model. This...
PIConGPU is a fully open, community-driven, 3D and 2D3V particle-in-cell code for the age of heterog...
Accelerated computing is becoming more diverse as new vendors and architectures come into play. Alth...
\u2014Emerging massively parallel architectures such as a general-purpose processor plus many-core p...
JuSPIC is a particle-in-cell (PIC) code, developed in the Simulation Lab for Plasma Physics of the J...
AbstractWe present a portable platform, called PIC_ENGINE, for accelerating Particle-In-Cell (PIC) c...
Abstract—Emerging massively parallel architectures such as a general-purpose processor plus many-cor...
We present a portable platform, called PIC_ENGINE, for accelerating Particle-In-Cell (PIC) codes on ...
Heterogeneous architectures are increasingly common in modern High-Performance Computing (HPC) syste...
Using two full applications with different characteristics, this thesis explores the performance and...
With the appearance of the heterogeneous platform OpenPower,many-core accelerator devices have been ...
PIConGPU is an open source, multi-platform particle-in-cell code scaling to the fastest supercompute...
This is the archive containing the software used for evaluations in the publication "Performance-Por...
WACCPD 2019: International Workshop on Accelerator Programming Using Directivesisbn 978-3-030-49943-...
Performance portability is considered to be an inevitable requirementin the exascale era. We explore...
The alpaka library defines and implements an abstract hierarchical redundant parallelism model. This...
PIConGPU is a fully open, community-driven, 3D and 2D3V particle-in-cell code for the age of heterog...
Accelerated computing is becoming more diverse as new vendors and architectures come into play. Alth...
\u2014Emerging massively parallel architectures such as a general-purpose processor plus many-core p...
JuSPIC is a particle-in-cell (PIC) code, developed in the Simulation Lab for Plasma Physics of the J...
AbstractWe present a portable platform, called PIC_ENGINE, for accelerating Particle-In-Cell (PIC) c...
Abstract—Emerging massively parallel architectures such as a general-purpose processor plus many-cor...
We present a portable platform, called PIC_ENGINE, for accelerating Particle-In-Cell (PIC) codes on ...
Heterogeneous architectures are increasingly common in modern High-Performance Computing (HPC) syste...
Using two full applications with different characteristics, this thesis explores the performance and...