2013-03-05Hostile environments, shrinking feature sizes and processor aging elicit a need for resilient computing. Traditional course-grained approaches, such as software Checkpoint and Restart (C/R) and hardware Triple Modular Redundancy (TMR), while exhibiting acceptable levels of fault coverage, are often wasteful of resources such as time, device/chip area or power. In order to mitigate these shortcomings, Resiliency-aware Scheduling (RaS), a source-level approach is introduced and described. Resiliency-aware Scheduling combines traditional compiler techniques such as critical path and dependency analysis with the ability to potentially modify the target architecture’s resource configuration. This new approach can, in many cases, offer ...
Energy efficient semiconductor chips are in high demand to cater the needs of today’s smart products...
Dependable real-time embedded systems are typically composed of a number of heterogeneous computing ...
Resilient design techniques are used to (i) ensure correct operation under dynamic variations and to...
Maintaining performance in a faulty distributed computing environment is a major challenge in the de...
Field-programmable gate arrays (FPGAs) offer the flexibility of general-purpose processors along wit...
With memories continuing to dominate the area, power, cost and performance of a design, there is a c...
We introduce a new design metric called system-resiliency which characterizes the maximum unpredicta...
Reliability is a serious concern for future extreme-scale high-performance computing (HPC) systems. ...
Large-scale systems face two main challenges: failure management and energy management. Failure mana...
International audienceThis paper focuses on the resilient scheduling of parallel jobs on high-perfor...
Contemporary reconfigurable hardware devices have the capability to achieve high performance, power ...
With memories continuing to dominate the area, power, cost and performance of a design, there is a c...
Resilient design techniques are used to (i) ensure correct operation under dynamic variations and to...
Over the past several decades, we have experienced tremendous growth of real-time systems in both sc...
In cyber-physical systems, where embedded computation interacts with physical processes, correctness...
Energy efficient semiconductor chips are in high demand to cater the needs of today’s smart products...
Dependable real-time embedded systems are typically composed of a number of heterogeneous computing ...
Resilient design techniques are used to (i) ensure correct operation under dynamic variations and to...
Maintaining performance in a faulty distributed computing environment is a major challenge in the de...
Field-programmable gate arrays (FPGAs) offer the flexibility of general-purpose processors along wit...
With memories continuing to dominate the area, power, cost and performance of a design, there is a c...
We introduce a new design metric called system-resiliency which characterizes the maximum unpredicta...
Reliability is a serious concern for future extreme-scale high-performance computing (HPC) systems. ...
Large-scale systems face two main challenges: failure management and energy management. Failure mana...
International audienceThis paper focuses on the resilient scheduling of parallel jobs on high-perfor...
Contemporary reconfigurable hardware devices have the capability to achieve high performance, power ...
With memories continuing to dominate the area, power, cost and performance of a design, there is a c...
Resilient design techniques are used to (i) ensure correct operation under dynamic variations and to...
Over the past several decades, we have experienced tremendous growth of real-time systems in both sc...
In cyber-physical systems, where embedded computation interacts with physical processes, correctness...
Energy efficient semiconductor chips are in high demand to cater the needs of today’s smart products...
Dependable real-time embedded systems are typically composed of a number of heterogeneous computing ...
Resilient design techniques are used to (i) ensure correct operation under dynamic variations and to...