Today's floating-point arithmetic landscape is broader than ever. While scientific computing has traditionally used single precision and double precision floating-point arithmetics, half precision is increasingly available in hardware and quadruple precision is supported in software. Lower precision arithmetic brings increased speed and reduced communication and energy costs, but it produces results of correspondingly low accuracy. Higher precisions are more expensive but can potentially provide great benefits, even if used sparingly. A variety of mixed precision algorithms have been developed that combine the superior performance of lower precisions with the better accuracy of higher precisions. Some of these algorithms aim to provide resu...
Abstract On modern architectures, the performance of 32-bit operations is often at least twice as fa...
It is well established that reduced precision arithmetic can be exploited to accelerate the solution...
Abstract—The aim of the paper is to analyze the potential of the mixed precision iterative refinemen...
Today's floating-point arithmetic landscape is broader than ever. While scientific computing has tra...
The largest dense linear systems that are being solved today are of order $n = 10^7$. Single precis...
Motivated by the demand in machine learning, modern computer hardware is increas- ingly supporting r...
Mixed-precision algorithms are a class of algorithms that uses low precision in part of the algorith...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
What is the fastest way to solve a linear system $Ax= b$ in arithmetic of a given precision when $A$...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
We introduce a novel approach to exploit mixed precision arithmetic for low-rank approximations. Our...
It is well established that mixed precision algorithms that factorize a matrix at a precision lower...
By using a combination of 32-bit and 64-bit floating point arithmetic, the per-formance of many dens...
The effects of rounding errors on algorithms in numerical linear algebra have been much-studied for ...
This article describes the design rationale, a C implementation, and conformance testing of a subset...
Abstract On modern architectures, the performance of 32-bit operations is often at least twice as fa...
It is well established that reduced precision arithmetic can be exploited to accelerate the solution...
Abstract—The aim of the paper is to analyze the potential of the mixed precision iterative refinemen...
Today's floating-point arithmetic landscape is broader than ever. While scientific computing has tra...
The largest dense linear systems that are being solved today are of order $n = 10^7$. Single precis...
Motivated by the demand in machine learning, modern computer hardware is increas- ingly supporting r...
Mixed-precision algorithms are a class of algorithms that uses low precision in part of the algorith...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
What is the fastest way to solve a linear system $Ax= b$ in arithmetic of a given precision when $A$...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
We introduce a novel approach to exploit mixed precision arithmetic for low-rank approximations. Our...
It is well established that mixed precision algorithms that factorize a matrix at a precision lower...
By using a combination of 32-bit and 64-bit floating point arithmetic, the per-formance of many dens...
The effects of rounding errors on algorithms in numerical linear algebra have been much-studied for ...
This article describes the design rationale, a C implementation, and conformance testing of a subset...
Abstract On modern architectures, the performance of 32-bit operations is often at least twice as fa...
It is well established that reduced precision arithmetic can be exploited to accelerate the solution...
Abstract—The aim of the paper is to analyze the potential of the mixed precision iterative refinemen...