© ACM, 2021. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in ACM Transactions on Mathematical Software, Volume 47, Issue , June 2021, http://doi.acm.org/10.1145/3441850[EN] The use of mixed precision in numerical algorithms is a promising strategy for accelerating scientific applications. In particular, the adoption of specialized hardware and data formats for low-precision arithmetic in high-end GPUs (graphics processing units) has motivated numerous efforts aiming at carefully reducing the working precision in order to speed up the computations. For algorithms whose performance is bound by the memory bandwidth, the idea ...
Low-precision arithmetic has had a transformative effect on the training of neural networks, reducin...
The Preconditioned Conjugate Gradient method is often used in numerical simulations. While being wid...
Many scientific applications require the solution of large and sparse linear systems of equations us...
This is the pre-peer reviewed version of the following article: Adaptive precision in block‐Jacobi p...
We propose an adaptive scheme to reduce communication overhead caused by data movement by selectivel...
Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many ...
The roofline model not only provides a powerful tool to relate an application\u27s performance with ...
In this work, we pursue the idea of radically decoupling the floating point format used for arithmet...
© ACM, YYYY. This is the author's version of the work "Anzt, H., Cojean, T., Flegar, G., Göbel, F., ...
[EN] With the memory bandwidth of current computer architectures being significantly slower than the...
The solution of linear systems of equations is a central task in a number of scientific and engineer...
Hardware trends have motivated the development of mixed precision algo-rithms in numerical linear al...
International audienceThis paper introduces an adaptive preconditioner for iterative solution of spa...
With the memory bandwidth of current computer architectures being significantly slower than the (flo...
With the breakdown of Dennard scaling in the mid-2000s and the end of Moore's law on the horizon, th...
Low-precision arithmetic has had a transformative effect on the training of neural networks, reducin...
The Preconditioned Conjugate Gradient method is often used in numerical simulations. While being wid...
Many scientific applications require the solution of large and sparse linear systems of equations us...
This is the pre-peer reviewed version of the following article: Adaptive precision in block‐Jacobi p...
We propose an adaptive scheme to reduce communication overhead caused by data movement by selectivel...
Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many ...
The roofline model not only provides a powerful tool to relate an application\u27s performance with ...
In this work, we pursue the idea of radically decoupling the floating point format used for arithmet...
© ACM, YYYY. This is the author's version of the work "Anzt, H., Cojean, T., Flegar, G., Göbel, F., ...
[EN] With the memory bandwidth of current computer architectures being significantly slower than the...
The solution of linear systems of equations is a central task in a number of scientific and engineer...
Hardware trends have motivated the development of mixed precision algo-rithms in numerical linear al...
International audienceThis paper introduces an adaptive preconditioner for iterative solution of spa...
With the memory bandwidth of current computer architectures being significantly slower than the (flo...
With the breakdown of Dennard scaling in the mid-2000s and the end of Moore's law on the horizon, th...
Low-precision arithmetic has had a transformative effect on the training of neural networks, reducin...
The Preconditioned Conjugate Gradient method is often used in numerical simulations. While being wid...
Many scientific applications require the solution of large and sparse linear systems of equations us...