Abstract. In hardware-aware high performance computing, block- asynchronous iteration and mixed precision iterative refinement are two techniques that are applied to leverage the computing power of SIMD accelerators like GPUs. Al-though they use a very different approach for this purpose, they share the basic idea of compensating the convergence behaviour of an inferior numerical al-gorithm by a more efficient usage of the provided computing power. In this paper, we want to analyze the potential of combining both techniques. There-fore, we implement a mixed precision iterative refinement algorithm using a block-asynchronous iteration as an error correction solver, and compare its performance with a pure implementation of a block-asynchronou...
International audienceAsynchronous iterations can be used to implement fixed-point methods such as J...
We propose a general algorithm for solving a $n\times n$ nonsingular linear system $Ax = b$ based on...
Abstract. FPGAs and GPUs are increasingly used in a range of high performance computing applications...
In hardware-aware high performance computing, block-asynchronous iteration and mixed precision itera...
We present several algorithms to compute the solution of a linear system of equations on a graphics ...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
AbstractThis paper explores the need for asynchronous iteration algorithms as smoothers in multigrid...
Low-precision floating-point arithmetic is a powerful tool for accelerating scientific computing app...
Abstract This paper explores the need for asynchronous iteration algorithms as smoothers in multigri...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
This paper explores the need for asynchronous iteration algorithms as smoothers in multigrid methods...
By using a combination of 32-bit and 64-bit floating point arithmetic, the per-formance of many dens...
In this survey paper, we compare native double precision solvers with emulated- and mixed- precision...
In this paper, we analyze the potential of asynchronous relaxation methods on Graphics Processing Un...
Due to non-associativity of floating-point operations and dynamic scheduling on parallel architectur...
International audienceAsynchronous iterations can be used to implement fixed-point methods such as J...
We propose a general algorithm for solving a $n\times n$ nonsingular linear system $Ax = b$ based on...
Abstract. FPGAs and GPUs are increasingly used in a range of high performance computing applications...
In hardware-aware high performance computing, block-asynchronous iteration and mixed precision itera...
We present several algorithms to compute the solution of a linear system of equations on a graphics ...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
AbstractThis paper explores the need for asynchronous iteration algorithms as smoothers in multigrid...
Low-precision floating-point arithmetic is a powerful tool for accelerating scientific computing app...
Abstract This paper explores the need for asynchronous iteration algorithms as smoothers in multigri...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
This paper explores the need for asynchronous iteration algorithms as smoothers in multigrid methods...
By using a combination of 32-bit and 64-bit floating point arithmetic, the per-formance of many dens...
In this survey paper, we compare native double precision solvers with emulated- and mixed- precision...
In this paper, we analyze the potential of asynchronous relaxation methods on Graphics Processing Un...
Due to non-associativity of floating-point operations and dynamic scheduling on parallel architectur...
International audienceAsynchronous iterations can be used to implement fixed-point methods such as J...
We propose a general algorithm for solving a $n\times n$ nonsingular linear system $Ax = b$ based on...
Abstract. FPGAs and GPUs are increasingly used in a range of high performance computing applications...