International audienceWe consider divisible load scientific applications executing on large-scale platforms subject to silent errors. While the goal is usually to complete the execution as fast as possible in expectation, another major concern is energy consumption. The use of dynamic voltage and frequency scaling (DVFS) can help save energy, but at the price of performance degradation. Consider the execution model where a set of K different speeds is given, and whenever a failure occurs, a different re-execution speed may be used. Can this help? We address the following bi-criteria problem: how to compute the optimal checkpointing period to minimize energy consumption while bounding the degradation in performance. We solve this bi-criteria...
International audienceIn this paper, we aim at minimizing the energy consumption when executing a di...
International audienceThis paper presents several energy-aware scheduling algorithms whose design is...
This short paper deals with parallel scientific applications using non-blocking and periodic coordin...
International audienceWe consider divisible load scientific applications executing on large-scale pl...
We consider divisible load scientific applications executing onlarge-scale platforms subject to sile...
We consider divisible load scientific applications executing onlarge-scale platforms subject to sile...
We consider divisible load scientific applications executing onlarge-scale platforms subject to sile...
International audienceIn this paper, we combine the traditional checkpointing and rollback recovery ...
International audienceIn this paper, we combine the traditional checkpointing and rollback recovery ...
In this paper, we combine the traditional checkpointing and rollback recovery strategies with verifi...
This thesis deals with two issues for future Exascale platforms, namely resilience and energy. We ad...
This thesis deals with two issues for future Exascale platforms, namely resilience and energy. We ad...
International audienceThis short paper deals with parallel scientific applications using non-blockin...
International audienceIn this paper, we aim at minimizing the energy consumption when executing a di...
International audienceThis paper investigates the optimal number of processors to execute a parallel...
International audienceIn this paper, we aim at minimizing the energy consumption when executing a di...
International audienceThis paper presents several energy-aware scheduling algorithms whose design is...
This short paper deals with parallel scientific applications using non-blocking and periodic coordin...
International audienceWe consider divisible load scientific applications executing on large-scale pl...
We consider divisible load scientific applications executing onlarge-scale platforms subject to sile...
We consider divisible load scientific applications executing onlarge-scale platforms subject to sile...
We consider divisible load scientific applications executing onlarge-scale platforms subject to sile...
International audienceIn this paper, we combine the traditional checkpointing and rollback recovery ...
International audienceIn this paper, we combine the traditional checkpointing and rollback recovery ...
In this paper, we combine the traditional checkpointing and rollback recovery strategies with verifi...
This thesis deals with two issues for future Exascale platforms, namely resilience and energy. We ad...
This thesis deals with two issues for future Exascale platforms, namely resilience and energy. We ad...
International audienceThis short paper deals with parallel scientific applications using non-blockin...
International audienceIn this paper, we aim at minimizing the energy consumption when executing a di...
International audienceThis paper investigates the optimal number of processors to execute a parallel...
International audienceIn this paper, we aim at minimizing the energy consumption when executing a di...
International audienceThis paper presents several energy-aware scheduling algorithms whose design is...
This short paper deals with parallel scientific applications using non-blocking and periodic coordin...