This paper presents a dynamic scheduling solution to achieve fault tolerance in many-core architectures. Triple Modular Redundancy is applied on the multi-threaded application to dynamically mitigate the effects of both permanent and transient faults, and to identify and isolate damaged units. The approach targets the best performance, while balancing the use of the healthy resources to limit wear-out and aging effects, which cause permanent damages. Experimental results on synthetic case studies are reported, to validate the ability to tolerate faults while optimizing performance and resource usage
Safety or mission critical applications have to recover from an error within an acceptable time wind...
Transient faults are emerging as a critical concern in the reliability of general-purpose microproce...
International audienceDevelopment trends for computing platforms moved from increasing the frequency...
This paper presents a dynamic scheduling solution to achieve fault tolerance in many-core architectu...
This paper presents a novel approach to the design of multi-/many-core systems with an adaptive leve...
Future multicore processors will become more susceptible to a variety of hardware failures. In parti...
Abstract: Fault-tolerance is a crucial aspect of safety critical systems. When such systems need to ...
Conference of IEEE International Symposium on Circuits and Systems, ISCAS 2015 ; Conference Date: 24...
Abstract In this paper we have discussed about various fault tolerant task scheduling algorithm for ...
Static redundancy allocation is inappropriate in hard real-time systems that operate in variable and...
Abstract—The vulnerability of multi-core processors is increas-ing due to tighter design margins and...
Part 2: Asian Conference on Availability, Reliability and Security (AsiaARES)International audienceF...
This paper presents different approaches for real-time fault tolerance using redundancy methods for ...
In this paper, it was discussed about various fault tolerant task scheduling Algorithm for the multi...
In this paper we are interested in mixed-criticality embed-ded applications implemented on distribut...
Safety or mission critical applications have to recover from an error within an acceptable time wind...
Transient faults are emerging as a critical concern in the reliability of general-purpose microproce...
International audienceDevelopment trends for computing platforms moved from increasing the frequency...
This paper presents a dynamic scheduling solution to achieve fault tolerance in many-core architectu...
This paper presents a novel approach to the design of multi-/many-core systems with an adaptive leve...
Future multicore processors will become more susceptible to a variety of hardware failures. In parti...
Abstract: Fault-tolerance is a crucial aspect of safety critical systems. When such systems need to ...
Conference of IEEE International Symposium on Circuits and Systems, ISCAS 2015 ; Conference Date: 24...
Abstract In this paper we have discussed about various fault tolerant task scheduling algorithm for ...
Static redundancy allocation is inappropriate in hard real-time systems that operate in variable and...
Abstract—The vulnerability of multi-core processors is increas-ing due to tighter design margins and...
Part 2: Asian Conference on Availability, Reliability and Security (AsiaARES)International audienceF...
This paper presents different approaches for real-time fault tolerance using redundancy methods for ...
In this paper, it was discussed about various fault tolerant task scheduling Algorithm for the multi...
In this paper we are interested in mixed-criticality embed-ded applications implemented on distribut...
Safety or mission critical applications have to recover from an error within an acceptable time wind...
Transient faults are emerging as a critical concern in the reliability of general-purpose microproce...
International audienceDevelopment trends for computing platforms moved from increasing the frequency...