The complexity of the latest HPC architectures increasingly limits the productivity of researchers in numerical algorithms and the `time to market' for parallel algorithms. Implementing a new method on a supercomputer today involves at least three levels of parallelism and typically several programming models like MPI, OpenMP and CUDA. Frameworks like Trilinos and PETSc have since many years been useful for testing new ideas in parallel algorithms. But when it comes to e.g. CPU/GPU clusters they fail to deliver convincing performance to date. We look at sparse solvers from a software engineer's point of view and advocate a programming model we call `SPMD+OK', introducing performance models in the test-driven development proce...
Abstract. The increasing number of processing elements and decreas-ing memory to core ratio in moder...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
The ESSEX project is funded by the German DFG priority programme 1648 Software for Exascale Computin...
Mathematicians and domain scientists who want to implement their algorithms on a supercomputer typic...
Numerous challenges have to be mastered as applications in scientific computing are being developed ...
The increasing complexity of hardware and software environments in high-performance computing poses ...
As modern supercomputers approach the Exascale, many numerical libraries face scalability issues due...
Iterative solvers for eigenvalue problems are often the only means of computing the extremal eigenva...
The trend towards highly parallel multi-processing is ubiquitous in all modern computer architecture...
This dissertation studies the sources of poor performance in scientific computing codes based on par...
The Petascale Computing Enabling Technologies (PCET) project addressed challenges arising from curre...
The increasing number of processing elements and decreasing memory to core ratio in modern high-perf...
Simulations on HPC systems have become an indispensable key technology in modern science and enginee...
For the analysis and solution of discretized ordinary or partial differential equations it is necess...
Clusters of workstations have become a cost-effective means of performing scientific computations. H...
Abstract. The increasing number of processing elements and decreas-ing memory to core ratio in moder...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
The ESSEX project is funded by the German DFG priority programme 1648 Software for Exascale Computin...
Mathematicians and domain scientists who want to implement their algorithms on a supercomputer typic...
Numerous challenges have to be mastered as applications in scientific computing are being developed ...
The increasing complexity of hardware and software environments in high-performance computing poses ...
As modern supercomputers approach the Exascale, many numerical libraries face scalability issues due...
Iterative solvers for eigenvalue problems are often the only means of computing the extremal eigenva...
The trend towards highly parallel multi-processing is ubiquitous in all modern computer architecture...
This dissertation studies the sources of poor performance in scientific computing codes based on par...
The Petascale Computing Enabling Technologies (PCET) project addressed challenges arising from curre...
The increasing number of processing elements and decreasing memory to core ratio in modern high-perf...
Simulations on HPC systems have become an indispensable key technology in modern science and enginee...
For the analysis and solution of discretized ordinary or partial differential equations it is necess...
Clusters of workstations have become a cost-effective means of performing scientific computations. H...
Abstract. The increasing number of processing elements and decreas-ing memory to core ratio in moder...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
The ESSEX project is funded by the German DFG priority programme 1648 Software for Exascale Computin...