Floating-point sums and dot products accumulate rounding errors that may render the result very inaccurate. To address this, Kulisch proposed to use an internal accumulator large enough to cover the full exponent range of floating-point. With it, sums and dot products become exact operations. This idea failed to materialize in general purpose processors, as it was considered to slow and/or too expensive in terms of resources. It may however be an interesting option in recon-figurable computing, where a designer may use use smaller, more resource-efficient floating-point formats, knowing that sums and dot products will be exact. Another motivation of this work is that these exact operations, contrary to classical floating point ones, are ass...
International audienceLet $u$ denote the relative rounding error of some floating-point format. Rece...
It has been shown that FPGAs could outperform high-end microprocessors on floating-point computation...
International audienceThe high performance and capacity of current FPGAs makes them suitable as acce...
Floating-point sums and dot products accumulate rounding errors that may render the result very inac...
International audienceFloating-point operators on FPGAs do not have to be identical to the ones avai...
International audienceFloating-point (FP) addition is non-associative and parallel reduction involvi...
National audienceOn modern multi-core, many-core, and heterogeneous architectures, floating-point co...
High-level synthesis (HLS) tools offer increased productivity regarding FPGA programming. However, d...
International audienceFPGAs are well known for their ability to perform non-standard computations no...
International audienceDot products (also called sums of products) are ubiquitous in matrix computati...
International audienceHigh-level synthesis (HLS) is a big step forward in terms of design productivi...
The set-wise summation operation is perhaps one of the most fundamental and widely used operations i...
International audienceWe present a fast algorithm together with its low-level implementation of corr...
This is an extended version of our ARITH-19 article.This paper presents a study of some basic blocks...
10 pagesInternational audienceThis paper presents a study of some basic blocks needed in the design ...
International audienceLet $u$ denote the relative rounding error of some floating-point format. Rece...
It has been shown that FPGAs could outperform high-end microprocessors on floating-point computation...
International audienceThe high performance and capacity of current FPGAs makes them suitable as acce...
Floating-point sums and dot products accumulate rounding errors that may render the result very inac...
International audienceFloating-point operators on FPGAs do not have to be identical to the ones avai...
International audienceFloating-point (FP) addition is non-associative and parallel reduction involvi...
National audienceOn modern multi-core, many-core, and heterogeneous architectures, floating-point co...
High-level synthesis (HLS) tools offer increased productivity regarding FPGA programming. However, d...
International audienceFPGAs are well known for their ability to perform non-standard computations no...
International audienceDot products (also called sums of products) are ubiquitous in matrix computati...
International audienceHigh-level synthesis (HLS) is a big step forward in terms of design productivi...
The set-wise summation operation is perhaps one of the most fundamental and widely used operations i...
International audienceWe present a fast algorithm together with its low-level implementation of corr...
This is an extended version of our ARITH-19 article.This paper presents a study of some basic blocks...
10 pagesInternational audienceThis paper presents a study of some basic blocks needed in the design ...
International audienceLet $u$ denote the relative rounding error of some floating-point format. Rece...
It has been shown that FPGAs could outperform high-end microprocessors on floating-point computation...
International audienceThe high performance and capacity of current FPGAs makes them suitable as acce...