In this survey paper, we compare native double precision solvers with emulated- and mixed- precision solvers of linear systems of equations as they typically arise in finite element discretisations. The emulation utilises two single float numbers to achieve higher precision, while the mixed precision iterative refinement computes residuals and updates the solution vector in double precision but solves the residual systems in single precision. Both techniques have been known since the 1960s, but little attention has been devoted to their performance aspects. Motivated by changing paradigms in processor technology and the emergence of highly parallel devices with outstanding single float performance, we adapt the emulation and mixed precision...
Programmable hardware, in particular Field Programmable Gate Arrays (FPGAs), promises a significant ...
The advent of general purpose graphics processing units (GPGPU\u27s) brings about a whole new platfo...
FPGA devices used in the HPC context promise an increased energy efficiency, enhancing the computing...
In this survey paper, we compare native double precision solvers with emulated- and mixed- precision...
The precision used in an algorithm affects the error and performance of individual computations, the...
FPGAs are becoming more and more attractive for high precision scientific computations. One of the m...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
AbstractThe performance of algebraic multigrid (AMG) algorithms, implemented in 4-byte floating poin...
Abstract—This paper introduces a novel mixed precision methodology for mathematical optimisation. It...
The continued development of improved algorithms and architecture for numerical simulations is at th...
Abstract—Current CPU and GPU architectures heavily use data and instruction parallelism at different...
Abstract. FPGAs and GPUs are increasingly used in a range of high performance computing applications...
Processor technology is still dramatically advancing and promises enormous improvements in processin...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
Programmable hardware, in particular Field Programmable Gate Arrays (FPGAs), promises a significant ...
The advent of general purpose graphics processing units (GPGPU\u27s) brings about a whole new platfo...
FPGA devices used in the HPC context promise an increased energy efficiency, enhancing the computing...
In this survey paper, we compare native double precision solvers with emulated- and mixed- precision...
The precision used in an algorithm affects the error and performance of individual computations, the...
FPGAs are becoming more and more attractive for high precision scientific computations. One of the m...
Abstract—We have previously suggested mixed precision iterative solvers specifically tailored to the...
AbstractThe performance of algebraic multigrid (AMG) algorithms, implemented in 4-byte floating poin...
Abstract—This paper introduces a novel mixed precision methodology for mathematical optimisation. It...
The continued development of improved algorithms and architecture for numerical simulations is at th...
Abstract—Current CPU and GPU architectures heavily use data and instruction parallelism at different...
Abstract. FPGAs and GPUs are increasingly used in a range of high performance computing applications...
Processor technology is still dramatically advancing and promises enormous improvements in processin...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
Programmable hardware, in particular Field Programmable Gate Arrays (FPGAs), promises a significant ...
The advent of general purpose graphics processing units (GPGPU\u27s) brings about a whole new platfo...
FPGA devices used in the HPC context promise an increased energy efficiency, enhancing the computing...