On many current and emerging computing architectures, single-precision calculations are at least twice as fast as double-precision calculations. In addition, the use of single precision may reduce pressure on memory bandwidth. The penalty for using single precision for the solution of linear systems is a potential loss of accuracy in the computed solutions. For sparse linear systems, the use of mixed precision in which double-precision iterative methods are preconditioned by a single-precision factorization can enable the recovery of high-precision solutions more quickly and use less memory than a sparse direct solver run using double-precision arithmetic. In this article, we consider the use of single precision within direct solvers for sp...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
This is the pre-peer reviewed version of the following article: Adaptive precision in block‐Jacobi p...
We propose an adaptive scheme to reduce communication overhead caused by data movement by selectivel...
It is well established that reduced precision arithmetic can be exploited to accelerate the solution...
It is well established that mixed precision algorithms that factorize a matrix at a precision lower...
At the heart of many computations in science and engineering lies the need to efficiently and accura...
The standard LU factorization-based solution process for linear systems can be enhanced in speed or ...
What is the fastest way to solve a linear system $Ax= b$ in arithmetic of a given precision when $A$...
In recent years, a number of new direct solvers for the solution of large sparse, symmetric linear s...
In recent years a number of solvers for the direct solution of large sparse symmetric linear systems...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
The need to solve large sparse linear systems of equations efficiently lies at the heart of many app...
Out-of-core sparse direct solvers reduce the amount of main memory needed to factorize and solve lar...
3rd International Workshop on Energy Efficient Supercomputing (E2SC '15)We formulate an implementati...
L'accessibilité grandissante des arithmétiques à précision faible (tfloat32, fp16, bfloat16, fp8) da...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
This is the pre-peer reviewed version of the following article: Adaptive precision in block‐Jacobi p...
We propose an adaptive scheme to reduce communication overhead caused by data movement by selectivel...
It is well established that reduced precision arithmetic can be exploited to accelerate the solution...
It is well established that mixed precision algorithms that factorize a matrix at a precision lower...
At the heart of many computations in science and engineering lies the need to efficiently and accura...
The standard LU factorization-based solution process for linear systems can be enhanced in speed or ...
What is the fastest way to solve a linear system $Ax= b$ in arithmetic of a given precision when $A$...
In recent years, a number of new direct solvers for the solution of large sparse, symmetric linear s...
In recent years a number of solvers for the direct solution of large sparse symmetric linear systems...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
The need to solve large sparse linear systems of equations efficiently lies at the heart of many app...
Out-of-core sparse direct solvers reduce the amount of main memory needed to factorize and solve lar...
3rd International Workshop on Energy Efficient Supercomputing (E2SC '15)We formulate an implementati...
L'accessibilité grandissante des arithmétiques à précision faible (tfloat32, fp16, bfloat16, fp8) da...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
This is the pre-peer reviewed version of the following article: Adaptive precision in block‐Jacobi p...
We propose an adaptive scheme to reduce communication overhead caused by data movement by selectivel...