Ultra-large–scale simulations via solving partial differential equations (PDEs) require very large computational systems for their timely solution. Studies shown the rate of failure grows with the system size, and these trends are likely to worsen in future machines. Thus, as systems, and the problems solved on them, continue to grow, the ability to survive failures is becoming a critical aspect of algorithm development. The sparse grid combination technique (SGCT) which is a cost-effective method for solving higher dimensional PDEs can be easily modified to provide algorithm-based fault tolerance. In this article, we describe how the SGCT can produce fault-tolerant versions of the Gyrokinetic Electromagnetic Numerical Experiment plasma app...
AbstractThis paper discusses on-going work with the Integrated Plasma Simulator (IPS), a framework f...
In this paper we will discuss some approaches to fault-tolerance for solving partial differential eq...
This work is based on the seminar titled ‘Resiliency in Numerical Algorithm Design for Extreme Scale...
The data volume of Partial Differential Equation (PDE) based ultra-large-scale scientific simul...
This paper continues to develop a fault tolerant extension of the sparse grid combination technique ...
This paper continues to develop a fault tolerant extension of the sparse grid combination technique ...
AbstractA key issue confronting petascale and exascale computing is the growth in probability of sof...
Many large scale scientific simulations involve the time evolution of systems modelled as Partial Di...
A key issue confronting petascale and exascale computing is the growth in probability of soft and ha...
Many petascale and exascale scientific simulations involve the time evolution of systems modelled as...
Plasma fusion is one of the promising candidates for an emission-free energy source and is heavily i...
Many petascale and exascale scientific simulations involve the time evolution of systems modelled as...
One of the challenges for efficiently and effectively using petascale and exascale computers is the ...
One of the challenges for efficiently and effectively using petascale and exascale computers is the ...
A key issue confronting petascale and exascale computing is the growth in probability of soft and ha...
AbstractThis paper discusses on-going work with the Integrated Plasma Simulator (IPS), a framework f...
In this paper we will discuss some approaches to fault-tolerance for solving partial differential eq...
This work is based on the seminar titled ‘Resiliency in Numerical Algorithm Design for Extreme Scale...
The data volume of Partial Differential Equation (PDE) based ultra-large-scale scientific simul...
This paper continues to develop a fault tolerant extension of the sparse grid combination technique ...
This paper continues to develop a fault tolerant extension of the sparse grid combination technique ...
AbstractA key issue confronting petascale and exascale computing is the growth in probability of sof...
Many large scale scientific simulations involve the time evolution of systems modelled as Partial Di...
A key issue confronting petascale and exascale computing is the growth in probability of soft and ha...
Many petascale and exascale scientific simulations involve the time evolution of systems modelled as...
Plasma fusion is one of the promising candidates for an emission-free energy source and is heavily i...
Many petascale and exascale scientific simulations involve the time evolution of systems modelled as...
One of the challenges for efficiently and effectively using petascale and exascale computers is the ...
One of the challenges for efficiently and effectively using petascale and exascale computers is the ...
A key issue confronting petascale and exascale computing is the growth in probability of soft and ha...
AbstractThis paper discusses on-going work with the Integrated Plasma Simulator (IPS), a framework f...
In this paper we will discuss some approaches to fault-tolerance for solving partial differential eq...
This work is based on the seminar titled ‘Resiliency in Numerical Algorithm Design for Extreme Scale...