International audienceIn this talk we will discuss possible numerical remedies to survive data loss in some numerical linear algebra solvers namely Krylov subspace linear solvers and some widely used eigensolvers. Assuming that a separate mechanism ensures fault detection, we propose numerical algorithms to extract relevant information from available data after a fault. After data extraction, well chosen part of missing data is regenerated through interpolation strategies to constitute meaningful inputs to numerically restart. We will also present some preliminary investigations to address soft error detection again at the application level in the conjugate gradient framework
AbstractIn the multi-peta-flop era for supercomputers, the number of computing cores is growing expo...
none4siBlock Krylov subspace methods (KSMs) comprise building blocks in many state-of-the-art solver...
We present a fault model designed to bring out the “worst” in iterative solvers based on mathematica...
International audienceIn this talk we will discuss possible numerical remedies to survive data loss...
International audienceThe advent of extreme scale machines will require the use of parallel resource...
International audienceThe advent of extreme scale machines will require the use of parallel resource...
International audience: The advent of extreme scale machines will require the use of parallel resour...
International audiencehe advent of extreme scale machines will require the use of parallel resources...
International audienceAs the computational power of high performance computing (HPC) systems continu...
This paper presents a method to protect iterative solvers from Detected and Uncorrected Errors (DUE)...
International audienceThe solution of large eigenproblems is involved in many scientific and enginee...
ELLIOTT III, JAMES JOHN. Resilient Iterative Linear Solvers Running Through Errors. (Under the direc...
As we stride toward the exascale era, due to increasing complexity of supercomputers, hard and soft ...
The convergence of Krylov subspace eigenvalue algorithms can be robustly measured by the angle the a...
Parallel implementations of Krylov subspace methods often help to accelerate the procedure of findin...
AbstractIn the multi-peta-flop era for supercomputers, the number of computing cores is growing expo...
none4siBlock Krylov subspace methods (KSMs) comprise building blocks in many state-of-the-art solver...
We present a fault model designed to bring out the “worst” in iterative solvers based on mathematica...
International audienceIn this talk we will discuss possible numerical remedies to survive data loss...
International audienceThe advent of extreme scale machines will require the use of parallel resource...
International audienceThe advent of extreme scale machines will require the use of parallel resource...
International audience: The advent of extreme scale machines will require the use of parallel resour...
International audiencehe advent of extreme scale machines will require the use of parallel resources...
International audienceAs the computational power of high performance computing (HPC) systems continu...
This paper presents a method to protect iterative solvers from Detected and Uncorrected Errors (DUE)...
International audienceThe solution of large eigenproblems is involved in many scientific and enginee...
ELLIOTT III, JAMES JOHN. Resilient Iterative Linear Solvers Running Through Errors. (Under the direc...
As we stride toward the exascale era, due to increasing complexity of supercomputers, hard and soft ...
The convergence of Krylov subspace eigenvalue algorithms can be robustly measured by the angle the a...
Parallel implementations of Krylov subspace methods often help to accelerate the procedure of findin...
AbstractIn the multi-peta-flop era for supercomputers, the number of computing cores is growing expo...
none4siBlock Krylov subspace methods (KSMs) comprise building blocks in many state-of-the-art solver...
We present a fault model designed to bring out the “worst” in iterative solvers based on mathematica...