AbstractWe present a general technique to solve Partial Differential Equations, called robust stencils, which make them tolerant to soft faults, i.e. bit flips arising in memory or CPU calculations. We show how it can be applied to a two-dimensional Lax-Wendroff solver. The resulting 2D robust stencils are derived using an orthogonal application of their 1D counterparts. Combinations of 3 to 5 base stencils can then be created. We describe how these are then implemented in a parallel advection solver. Various robust stencil combinations are explored, representing tradeoff between performance and robustness. The results indicate that the 3-stencil robust combinations are slightly faster on large parallel workloads than Triple Modular Redunda...
This paper continues to develop a fault tolerant extension of the sparse grid combination technique ...
ELLIOTT III, JAMES JOHN. Resilient Iterative Linear Solvers Running Through Errors. (Under the direc...
We investigate the design of dynamic programming algorithms in unreliable memories, i.e., in the pre...
We present a general technique to solve Partial Differential Equations, called robust stencils, whic...
AbstractA key issue confronting petascale and exascale computing is the growth in probability of sof...
Resilient algorithms in high-performance computing are subject to rigorous non-functional constrain...
In previous works, approaches for fault tolerant computation of PDEs were described which utilise fl...
A key issue confronting petascale and exascale computing is the growth in probability of soft and ha...
Energy increasingly constrains modern computer hardware, yet protecting computations and data agains...
AbstractIn the multi-peta-flop era for supercomputers, the number of computing cores is growing expo...
Some of the present day applications run on computer platforms with large and inexpensive memories, ...
Soft errors are increasing in modern computer systems. These faults can corrupt the results of nume...
One of the challenges for efficiently and effectively using petascale and exascale computers is the ...
We present a fault model designed to bring out the “worst” in iterative solvers based on mathematica...
A key issue confronting petascale and exascale computing is the growth in probability of soft and ha...
This paper continues to develop a fault tolerant extension of the sparse grid combination technique ...
ELLIOTT III, JAMES JOHN. Resilient Iterative Linear Solvers Running Through Errors. (Under the direc...
We investigate the design of dynamic programming algorithms in unreliable memories, i.e., in the pre...
We present a general technique to solve Partial Differential Equations, called robust stencils, whic...
AbstractA key issue confronting petascale and exascale computing is the growth in probability of sof...
Resilient algorithms in high-performance computing are subject to rigorous non-functional constrain...
In previous works, approaches for fault tolerant computation of PDEs were described which utilise fl...
A key issue confronting petascale and exascale computing is the growth in probability of soft and ha...
Energy increasingly constrains modern computer hardware, yet protecting computations and data agains...
AbstractIn the multi-peta-flop era for supercomputers, the number of computing cores is growing expo...
Some of the present day applications run on computer platforms with large and inexpensive memories, ...
Soft errors are increasing in modern computer systems. These faults can corrupt the results of nume...
One of the challenges for efficiently and effectively using petascale and exascale computers is the ...
We present a fault model designed to bring out the “worst” in iterative solvers based on mathematica...
A key issue confronting petascale and exascale computing is the growth in probability of soft and ha...
This paper continues to develop a fault tolerant extension of the sparse grid combination technique ...
ELLIOTT III, JAMES JOHN. Resilient Iterative Linear Solvers Running Through Errors. (Under the direc...
We investigate the design of dynamic programming algorithms in unreliable memories, i.e., in the pre...