In this work, we pursue the idea of radically decoupling the floating point format used for arithmetic operations from the format used to store the data in memory. We complement this idea with a customized precision memory format derived by splitting the mantissa (significand) of standard IEEE formats into segments, such that values can be accessed faster if lower accuracy is acceptable. Combined with precision‐aware algorithms that dynamically adapt the data access accuracy to the numerical requirements, the customized precision memory format can render attractive runtime savings without impacting the memory footprint of the data or the accuracy of the final result. In an experimental analysis using the adaptive precision Jacobi method on ...
By using a combination of 32-bit and 64-bit floating point arithmetic, the per-formance of many dens...
A plethora of program analysis and optimization techniques rely on linear programming at their heart...
On many current and emerging computing architectures, single-precision calculations are at least twi...
3rd International Workshop on Energy Efficient Supercomputing (E2SC '15)We formulate an implementati...
We describe the application of a communication-reduction technique for the PageRank algorithm that d...
With the memory bandwidth of current computer architectures being significantly slower than the (flo...
We propose an adaptive scheme to reduce communication overhead caused by data movement by selectivel...
[EN] We describe the application of a communication-reduction technique for the PageRank algorithm t...
This is the pre-peer reviewed version of the following article: Adaptive precision in block‐Jacobi p...
Abstract On modern architectures, the performance of 32-bit operations is often at least twice as fa...
© ACM, 2021. This is the author's version of the work. It is posted here by permission of ACM for yo...
By using a combination of 32-bit and 64-bit floating point arithmetic the performance of many sparse...
It is well established that reduced precision arithmetic can be exploited to accelerate the solution...
Recent versions of microprocessors exhibit performance characteristics for 32 bit floating point ari...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
By using a combination of 32-bit and 64-bit floating point arithmetic, the per-formance of many dens...
A plethora of program analysis and optimization techniques rely on linear programming at their heart...
On many current and emerging computing architectures, single-precision calculations are at least twi...
3rd International Workshop on Energy Efficient Supercomputing (E2SC '15)We formulate an implementati...
We describe the application of a communication-reduction technique for the PageRank algorithm that d...
With the memory bandwidth of current computer architectures being significantly slower than the (flo...
We propose an adaptive scheme to reduce communication overhead caused by data movement by selectivel...
[EN] We describe the application of a communication-reduction technique for the PageRank algorithm t...
This is the pre-peer reviewed version of the following article: Adaptive precision in block‐Jacobi p...
Abstract On modern architectures, the performance of 32-bit operations is often at least twice as fa...
© ACM, 2021. This is the author's version of the work. It is posted here by permission of ACM for yo...
By using a combination of 32-bit and 64-bit floating point arithmetic the performance of many sparse...
It is well established that reduced precision arithmetic can be exploited to accelerate the solution...
Recent versions of microprocessors exhibit performance characteristics for 32 bit floating point ari...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
By using a combination of 32-bit and 64-bit floating point arithmetic, the per-formance of many dens...
A plethora of program analysis and optimization techniques rely on linear programming at their heart...
On many current and emerging computing architectures, single-precision calculations are at least twi...