International audienceThe solution of large eigenproblems is involved in many scientific and engineering applications when, for instance stability analysis is a concern. For large simulation in material physics or thermo-acoustics, the calculation can last for many hours on large parallel platforms. However, on future large-scale systems, the mean time between failures (MTBF) of the system is expected to decrease so that many faults could occur during the solution of large eigenproblems. Consequently, it becomes critical to design parallel eigensolvers that can survive faults. In that framework, we investigate the relevance of approaches relying on numerical techniques, which might be combined with more classical techniques for real large s...
This paper continues to develop a fault tolerant extension of the sparse grid combination technique ...
With the proliferation of parallel and distributed systems, it is an increasingly important problem ...
Energy increasingly constrains modern computer hardware, yet protecting computations and data agains...
International audienceThe solution of large eigenproblems is involved in many scientific and enginee...
International audienceThe solution of large eigenproblems is involved in many scientific and enginee...
International audienceThe advent of extreme scale machines will require the use of parallel resource...
International audiencehe advent of extreme scale machines will require the use of parallel resources...
International audienceIn this talk we will discuss possible numerical remedies to survive data loss...
International audienceAs the computational power of high performance computing (HPC) systems continu...
Several recovery techniques for parallel iterative methods are presented. First, the implementation ...
International audienceIn this talk we will discuss possible numerical remedies to survive data loss ...
International audience: The advent of extreme scale machines will require the use of parallel resour...
Iterative solvers for eigenvalue problems are often the only means of computing the extremal eigenva...
International audienceThe advent of extreme scale machines will require the use of parallel resource...
This paper continues to develop a fault tolerant extension of the sparse grid combination technique ...
This paper continues to develop a fault tolerant extension of the sparse grid combination technique ...
With the proliferation of parallel and distributed systems, it is an increasingly important problem ...
Energy increasingly constrains modern computer hardware, yet protecting computations and data agains...
International audienceThe solution of large eigenproblems is involved in many scientific and enginee...
International audienceThe solution of large eigenproblems is involved in many scientific and enginee...
International audienceThe advent of extreme scale machines will require the use of parallel resource...
International audiencehe advent of extreme scale machines will require the use of parallel resources...
International audienceIn this talk we will discuss possible numerical remedies to survive data loss...
International audienceAs the computational power of high performance computing (HPC) systems continu...
Several recovery techniques for parallel iterative methods are presented. First, the implementation ...
International audienceIn this talk we will discuss possible numerical remedies to survive data loss ...
International audience: The advent of extreme scale machines will require the use of parallel resour...
Iterative solvers for eigenvalue problems are often the only means of computing the extremal eigenva...
International audienceThe advent of extreme scale machines will require the use of parallel resource...
This paper continues to develop a fault tolerant extension of the sparse grid combination technique ...
This paper continues to develop a fault tolerant extension of the sparse grid combination technique ...
With the proliferation of parallel and distributed systems, it is an increasingly important problem ...
Energy increasingly constrains modern computer hardware, yet protecting computations and data agains...