Abstract. A major challenge in PDE software is the balance between user-level flexibility and performance on heterogeneous hardware. We discuss our ideas on how this challenge can be tackled, exemplarily for the DUNE framework and in particular its linear algebra and solver components. We demonstrate how the for-mer MPI-only implementation is modified to support MPI+[CPU/GPU] threading and vectorisation. To this end, we devise a novel block extension of the recently pro-posed SELL-C-σ format. The efficiency of our approach is underlined by benchmark computations that exhibit reasonable speedups over the CPU-MPI-only case.
This document gives an introduction to the Distributed and Unified Numerics Environment (DUNE). DUNE...
Abstract—Parallel Discrete Event Simulation (PDES) can substantially improve the performance and cap...
Supercomputing applications rely on strong scaling to achieve faster results on a larger number of p...
We present advances concerning efficient finite element assembly and linear solvers on current and u...
Not only in the field of High-Performance Computing (HPC), Field Programmable Gate Arrays (FPGAs) ar...
In this paper we describe the recent development of the interface library Dune and in particular the...
International audienceApplications to solve large and complex partial derivative equation systems of...
The proliferation of accelerators in modern clusters makes efficient coprocessor programming a key r...
Click on the DOI link to access the article (may not be free).The advancement of multicore systems d...
As high-end computing systems continue to grow in scale, recent advances in multi- and many-core arc...
Parallel accelerators are playing an increasingly important role in scientific computing. However, i...
During the last years, there has been a signi¿cant increment in the variety of hardware to overcome ...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
MPI is the new de facto standard for multicomputer and cluster message passing. This short paper des...
<p>Presentation given at the 2014 International Workshop on Numerical Methods on High-Performance Co...
This document gives an introduction to the Distributed and Unified Numerics Environment (DUNE). DUNE...
Abstract—Parallel Discrete Event Simulation (PDES) can substantially improve the performance and cap...
Supercomputing applications rely on strong scaling to achieve faster results on a larger number of p...
We present advances concerning efficient finite element assembly and linear solvers on current and u...
Not only in the field of High-Performance Computing (HPC), Field Programmable Gate Arrays (FPGAs) ar...
In this paper we describe the recent development of the interface library Dune and in particular the...
International audienceApplications to solve large and complex partial derivative equation systems of...
The proliferation of accelerators in modern clusters makes efficient coprocessor programming a key r...
Click on the DOI link to access the article (may not be free).The advancement of multicore systems d...
As high-end computing systems continue to grow in scale, recent advances in multi- and many-core arc...
Parallel accelerators are playing an increasingly important role in scientific computing. However, i...
During the last years, there has been a signi¿cant increment in the variety of hardware to overcome ...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
MPI is the new de facto standard for multicomputer and cluster message passing. This short paper des...
<p>Presentation given at the 2014 International Workshop on Numerical Methods on High-Performance Co...
This document gives an introduction to the Distributed and Unified Numerics Environment (DUNE). DUNE...
Abstract—Parallel Discrete Event Simulation (PDES) can substantially improve the performance and cap...
Supercomputing applications rely on strong scaling to achieve faster results on a larger number of p...