Mathematicians and domain scientists who want to implement their algorithms on a supercomputer typically rely on HPC software frameworks to deliver reasonable performance. While it is important to separate the concerns of applications, algorithm and performance to some extent (especially in an increasingly complex hardware landscape), this often leads to a dramatic performance loss in practice because applications may benefit from tailored operations that are not seen as useful in the general setting of a numerical library. A prominent example is the orthogonalization of "tall and skinny" matrices, which leads to memory-bounded performance, so that LAPACK (optimized for compute-bound situations) does not deliver optimal performa...
The ESSEX project is funded by the German DFG priority programme 1648 Software for Exascale Computin...
The aim of this paper is to show an effective reorganization of the nonsymmetric block lanczos algo...
Sparse linear iterative solvers are essential for many large-scale simulations. Much of the runtime ...
The complexity of the latest HPC architectures increasingly limits the productivity of researchers i...
The increasing complexity of hardware and software environments in high-performance computing poses ...
As modern supercomputers approach the Exascale, many numerical libraries face scalability issues due...
Numerous challenges have to be mastered as applications in scientific computing are being developed ...
This paper presents a new software framework for solving large and sparse linear systems on current ...
This paper presents a new software framework for solving large and sparse linear systems on current ...
International audienceThe ever growing complexity and scale of parallel architectures imposes to rew...
AbstractWe examine the computational efficiency of linear algebra components in iterative solvers fo...
This report has been developed over the work done in the deliverable [Nava94] There it was shown tha...
While many of the architectural details of future exascale-class high performance computer systems ...
Many computationally intensive problems in engineering and science, such as those driven by Partial ...
International audienceWe present a method for automatically selecting optimal implementations of spa...
The ESSEX project is funded by the German DFG priority programme 1648 Software for Exascale Computin...
The aim of this paper is to show an effective reorganization of the nonsymmetric block lanczos algo...
Sparse linear iterative solvers are essential for many large-scale simulations. Much of the runtime ...
The complexity of the latest HPC architectures increasingly limits the productivity of researchers i...
The increasing complexity of hardware and software environments in high-performance computing poses ...
As modern supercomputers approach the Exascale, many numerical libraries face scalability issues due...
Numerous challenges have to be mastered as applications in scientific computing are being developed ...
This paper presents a new software framework for solving large and sparse linear systems on current ...
This paper presents a new software framework for solving large and sparse linear systems on current ...
International audienceThe ever growing complexity and scale of parallel architectures imposes to rew...
AbstractWe examine the computational efficiency of linear algebra components in iterative solvers fo...
This report has been developed over the work done in the deliverable [Nava94] There it was shown tha...
While many of the architectural details of future exascale-class high performance computer systems ...
Many computationally intensive problems in engineering and science, such as those driven by Partial ...
International audienceWe present a method for automatically selecting optimal implementations of spa...
The ESSEX project is funded by the German DFG priority programme 1648 Software for Exascale Computin...
The aim of this paper is to show an effective reorganization of the nonsymmetric block lanczos algo...
Sparse linear iterative solvers are essential for many large-scale simulations. Much of the runtime ...