Writing mixed-precision kernels allows to achieve higher throughput together with outputs whose precision remain within given limits. The recent introduction of native half-precision arithmetic capabilities in several GPUs, such as NVIDIA P100 and AMD Vega 10, contributes to make precision-tuning even more relevant as of late. However, it is not trivial to manually find which variables are to be represented as half-precision instead of single- or double-precision. Although the use of half-precision arithmetic can speed up kernel execution considerably, it can also result in providing non-usable kernel outputs, whenever the wrong variables are declared using the half-precision data-type. In this paper we present an automatic approach for pre...
High performance Computing is increasingly being done on parallel machines like GPUs. In my work, I ...
Mixed-precision is a paradigm that tries to combine computations with different levels of precision ...
In this work, we propose a method that allows us to decrease the energy consumption in supercomputin...
Writing mixed-precision kernels allows to achieve higher throughput together with outputs whose prec...
Writing mixed-precision kernels allows to achieve higher throughput together with outputs whose prec...
ixed precision is an approximate computing technique that can be used to trade-off computation accur...
Precision tuning consists of finding the least floating-point formats enabling a program to compute ...
Mixed precision is an approximate computing technique that can be used to trade-off computation accu...
International audienceError-tolerating applications are increasingly common in the emerging field of...
Traditional optimization methods rely on the use of single-precision floating point arithmetic, whic...
Abstract—This paper introduces a novel mixed precision methodology for mathematical optimisation. It...
Field of study: Electrical engineering.Dr. Michela Becchi, Thesis Supervisor."December 2017."Floatin...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...
Low-precision arithmetic has had a transformative effect on the training of neural networks, reducin...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...
High performance Computing is increasingly being done on parallel machines like GPUs. In my work, I ...
Mixed-precision is a paradigm that tries to combine computations with different levels of precision ...
In this work, we propose a method that allows us to decrease the energy consumption in supercomputin...
Writing mixed-precision kernels allows to achieve higher throughput together with outputs whose prec...
Writing mixed-precision kernels allows to achieve higher throughput together with outputs whose prec...
ixed precision is an approximate computing technique that can be used to trade-off computation accur...
Precision tuning consists of finding the least floating-point formats enabling a program to compute ...
Mixed precision is an approximate computing technique that can be used to trade-off computation accu...
International audienceError-tolerating applications are increasingly common in the emerging field of...
Traditional optimization methods rely on the use of single-precision floating point arithmetic, whic...
Abstract—This paper introduces a novel mixed precision methodology for mathematical optimisation. It...
Field of study: Electrical engineering.Dr. Michela Becchi, Thesis Supervisor."December 2017."Floatin...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...
Low-precision arithmetic has had a transformative effect on the training of neural networks, reducin...
Graphics Processing Units (GPUs) have revolutionized the HPC landscape. The first generation of exas...
High performance Computing is increasingly being done on parallel machines like GPUs. In my work, I ...
Mixed-precision is a paradigm that tries to combine computations with different levels of precision ...
In this work, we propose a method that allows us to decrease the energy consumption in supercomputin...