Many current approaches to software-implemented fault tolerance (SIFT) rely on process replication, which is often prohibitively expensive for practical use due to its high performance overhead and cost. The Adaptive Reconfigurable Mobile Objects of Reliability (Armor) middleware architecture offers a scalable low-overhead way to provide high-dependability services to applications. It uses coordinated multithreaded processes to manage redundant resources across interconnected nodes, detect errors in user applications and infrastructural components, and provide failure recovery. The authors describe their experiences and lessons learned in deploying Armor in several diverse fields. The widespread availability of rela-tively low-cost, high-pe...
The functionality and the performance of smart environment applications can be hampered by faults. F...
International audienceEvolution of systems during their operational life is mandatory and both updat...
International audienceEvolution of systems during their operational life is mandatory and both updat...
227 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2003.This thesis introduces the th...
Clusters of message-passing computing nodes provide high-performance platforms for distributed appli...
Clusters of message-passing computing nodes provide high-performance platforms for distributed appli...
Exascale systems of the future are predicted to have mean time between failures (MTBF) of less than ...
Exascale systems of the future are predicted to have mean time between failures (MTBF) of less than ...
Exascale systems of the future are predicted to have mean time between failures (MTBF) of less than ...
. Fault-tolerant programs are typically not only difficult to implement but also incur extra costs i...
Today’s software engineering and application development trend is to take advantage of reusable soft...
Abstract. Fault-tolerant programs are typically not only difficult to implement but also incur extra...
The functionality and the performance of smart environment applications can be hampered by faults. F...
To meet an insatiable consumer demand for greater performance at less power, silicon technology has ...
To meet an insatiable consumer demand for greater performance at less power, silicon technology has ...
The functionality and the performance of smart environment applications can be hampered by faults. F...
International audienceEvolution of systems during their operational life is mandatory and both updat...
International audienceEvolution of systems during their operational life is mandatory and both updat...
227 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2003.This thesis introduces the th...
Clusters of message-passing computing nodes provide high-performance platforms for distributed appli...
Clusters of message-passing computing nodes provide high-performance platforms for distributed appli...
Exascale systems of the future are predicted to have mean time between failures (MTBF) of less than ...
Exascale systems of the future are predicted to have mean time between failures (MTBF) of less than ...
Exascale systems of the future are predicted to have mean time between failures (MTBF) of less than ...
. Fault-tolerant programs are typically not only difficult to implement but also incur extra costs i...
Today’s software engineering and application development trend is to take advantage of reusable soft...
Abstract. Fault-tolerant programs are typically not only difficult to implement but also incur extra...
The functionality and the performance of smart environment applications can be hampered by faults. F...
To meet an insatiable consumer demand for greater performance at less power, silicon technology has ...
To meet an insatiable consumer demand for greater performance at less power, silicon technology has ...
The functionality and the performance of smart environment applications can be hampered by faults. F...
International audienceEvolution of systems during their operational life is mandatory and both updat...
International audienceEvolution of systems during their operational life is mandatory and both updat...