International audienceOn modern parallel architectures, floating-point computations may become non-deterministic and, therefore, non-reproducible mainly due to non-associativity of floating-point operations. We propose an algorithm to solve dense triangular systems by leveraging the standard parallel triangular solver and our, recently introduced, multi-level exact summation approach. Finally, we present implementations of the proposed fast repro-ducible triangular solver and results on recent NVIDIA GPUs
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
We consider the problem of computing a scaling α such that the solution x of the scaled linear syste...
An error complexity analysis of two algorithms for solving a unit-diagonal triangular system is give...
International audienceOn modern parallel architectures, floating-point computations may become non-d...
International audience-point computations are not deterministic on parallel environments.Therefore, ...
National audienceOn modern multi-core, many-core, and heterogeneous architectures, floating-point co...
On modern multi-core, many-core, and heterogeneous architectures, floating-point computations, espec...
International audienceOn modern multi-core, many-core, and heterogeneous architectures, floating-poi...
Numerical Reproducibility at Exascale (NRE2015) workshop held as part of the Supercomputing Conferen...
International audienceDue to non-associativity of floating-point operations and dynamic schedu...
Due to non-associativity of floating-point operations and dynamic scheduling on parallel architectur...
A few parallel algorithms for solving triangular systems resulting from parallel factorization of sp...
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
We consider the problem of computing a scaling α such that the solution x of the scaled linear syste...
An error complexity analysis of two algorithms for solving a unit-diagonal triangular system is give...
International audienceOn modern parallel architectures, floating-point computations may become non-d...
International audience-point computations are not deterministic on parallel environments.Therefore, ...
National audienceOn modern multi-core, many-core, and heterogeneous architectures, floating-point co...
On modern multi-core, many-core, and heterogeneous architectures, floating-point computations, espec...
International audienceOn modern multi-core, many-core, and heterogeneous architectures, floating-poi...
Numerical Reproducibility at Exascale (NRE2015) workshop held as part of the Supercomputing Conferen...
International audienceDue to non-associativity of floating-point operations and dynamic schedu...
Due to non-associativity of floating-point operations and dynamic scheduling on parallel architectur...
A few parallel algorithms for solving triangular systems resulting from parallel factorization of sp...
Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications...
We consider the problem of computing a scaling α such that the solution x of the scaled linear syste...
An error complexity analysis of two algorithms for solving a unit-diagonal triangular system is give...