Mixed-precision algorithms are a class of algorithms that uses low precision in part of the algorithm in order to save time and energy with less accurate computation and communication. These algorithms usually utilize iterative refinement processes to improve the approximate solution obtained from low precision to the accuracy we desire from doing all the computation in high precision. Due to the demand of deep learning applications, there are hardware developments offering different low-precision formats including half precision (FP16), Bfloat16 and integer operations for quantized integers, which uses integers with a shared scalar to represent a set of equally spaced numbers. As new hardware architectures focus on bringing performance in ...
Hardware trends have motivated the development of mixed precision algo-rithms in numerical linear al...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
We propose a general algorithm for solving a $n\times n$ nonsingular linear system $Ax = b$ based on...
Today's floating-point arithmetic landscape is broader than ever. While scientific computing has tra...
AbstractWe investigate novel iterative refinement methods for solving eigenvalue problems which are ...
Motivated by the demand in machine learning, modern computer hardware is increas- ingly supporting r...
The eigenvalue problem is a fundamental problem in scientific computing. In this paper, we propose a...
International audienceThe standard LU factorization-based solution process for linear systems can be...
summary:With the emergence of mixed precision hardware, mixed precision GMRES-based iterative refine...
The increasing availability of very low precisions (tfloat32, fp16, bfloat16, fp8) in hardware pushe...
What is the fastest way to solve a linear system $Ax= b$ in arithmetic of a given precision when $A$...
A new hybrid algorithm for LDU-factorization for large sparse matrix combining iterative solver, whi...
L'accessibilité grandissante des arithmétiques à précision faible (tfloat32, fp16, bfloat16, fp8) da...
It is well established that mixed precision algorithms that factorize a matrix at a precision lower...
The real symmetric tridiagonal eigenproblem is of outstanding importance in numerical computations; ...
Hardware trends have motivated the development of mixed precision algo-rithms in numerical linear al...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
We propose a general algorithm for solving a $n\times n$ nonsingular linear system $Ax = b$ based on...
Today's floating-point arithmetic landscape is broader than ever. While scientific computing has tra...
AbstractWe investigate novel iterative refinement methods for solving eigenvalue problems which are ...
Motivated by the demand in machine learning, modern computer hardware is increas- ingly supporting r...
The eigenvalue problem is a fundamental problem in scientific computing. In this paper, we propose a...
International audienceThe standard LU factorization-based solution process for linear systems can be...
summary:With the emergence of mixed precision hardware, mixed precision GMRES-based iterative refine...
The increasing availability of very low precisions (tfloat32, fp16, bfloat16, fp8) in hardware pushe...
What is the fastest way to solve a linear system $Ax= b$ in arithmetic of a given precision when $A$...
A new hybrid algorithm for LDU-factorization for large sparse matrix combining iterative solver, whi...
L'accessibilité grandissante des arithmétiques à précision faible (tfloat32, fp16, bfloat16, fp8) da...
It is well established that mixed precision algorithms that factorize a matrix at a precision lower...
The real symmetric tridiagonal eigenproblem is of outstanding importance in numerical computations; ...
Hardware trends have motivated the development of mixed precision algo-rithms in numerical linear al...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
We propose a general algorithm for solving a $n\times n$ nonsingular linear system $Ax = b$ based on...