Fault-tolerance is due to the semiconductor technology development important, not only for safety-critical systems but also for general-purpose (non-safety critical) systems. However, instead of guaranteeing that deadlines always are met, it is for general-purpose systems important to minimize the average execution time (AET) while ensuring fault-tolerance. For a given job and a soft (transient) error probability, we define mathematical formulas for AET that includes bus communication overhead for both voting (active replication) and rollback-recovery with checkpointing (RRC). And, for a given multi-processor system-on-chip (MPSoC), we define integer linear programming (ILP) models that minimize AET including bus communication overhead when...
As machines increase in scale, it is predicted that failure rates of supercomputers will correspondi...
Researchers have mentioned that the three most difficult and growing problems in the future of high-...
Since the last decade, computing systems turn to large scale parallel platforms composed of thousand...
Fault-tolerance is due to the semiconductor technology development important, not only for safety-cr...
Fault-tolerance is due to the semiconductor technology development important, not only for safety-cr...
Due to increased complexity of today’s computer systems, which are manufactured in recent semiconduc...
Increasing soft error rates in recent semiconductor technologies enforce the usage of fault toleranc...
For the vast majority of computer systems correct operation is defined as producing the correct resu...
International audienceParallel execution time is expected to decrease as the number of processors in...
Abstract—We present an approach to the synthesis of fault-tol-erant hard real-time systems for safet...
This report provides an introduction to the design of scheduling algorithms to cope with faults on l...
Researchers have mentioned that the three most difficult and growing problems in the future of high-...
The large scale of current and next-generation massively parallel processing (MPP) systems presents ...
To meet an insatiable consumer demand for greater performance at less power, silicon technology has ...
Correct operation of real-time systems (RTS) is defined as producing correct results within given ti...
As machines increase in scale, it is predicted that failure rates of supercomputers will correspondi...
Researchers have mentioned that the three most difficult and growing problems in the future of high-...
Since the last decade, computing systems turn to large scale parallel platforms composed of thousand...
Fault-tolerance is due to the semiconductor technology development important, not only for safety-cr...
Fault-tolerance is due to the semiconductor technology development important, not only for safety-cr...
Due to increased complexity of today’s computer systems, which are manufactured in recent semiconduc...
Increasing soft error rates in recent semiconductor technologies enforce the usage of fault toleranc...
For the vast majority of computer systems correct operation is defined as producing the correct resu...
International audienceParallel execution time is expected to decrease as the number of processors in...
Abstract—We present an approach to the synthesis of fault-tol-erant hard real-time systems for safet...
This report provides an introduction to the design of scheduling algorithms to cope with faults on l...
Researchers have mentioned that the three most difficult and growing problems in the future of high-...
The large scale of current and next-generation massively parallel processing (MPP) systems presents ...
To meet an insatiable consumer demand for greater performance at less power, silicon technology has ...
Correct operation of real-time systems (RTS) is defined as producing correct results within given ti...
As machines increase in scale, it is predicted that failure rates of supercomputers will correspondi...
Researchers have mentioned that the three most difficult and growing problems in the future of high-...
Since the last decade, computing systems turn to large scale parallel platforms composed of thousand...