Enabling applications for computational Grids requires new approaches to develop applications that can effectively cope with resource volatility. Applications must be resilient to resource faults, adapting the behaviour to available resources. This paper describes an approach to application-level adaptation that efficiently supports application-level checkpointing. The key of this work is the concept of pluggable parallelisation, which localises parallelisation issues into multiple modules that can be (un)plugged to match resource availability. This paper shows how pluggable parallelisation can be extended to effectively support checkpointing and run-time adaptation. We present the developed pluggable mechanism that helps the programmer to ...
A grid is a distributed computational and storage environment often composed of heterogeneous autono...
International audienceIn large-scale Grid computing environments, providing fault-tolerance is requi...
High Performance Computing (HPC) systems represent the peak of modern computational capability. As ...
Migrating traditional scientific applications to computational Grids requires programming tools that...
Migrating traditional scientific applications to computational Grids requires programming tools that...
Migrating traditional scientific applications to computational Grids requires programming tools that...
Jobs in Grid workflows are exposed to different types of failure. It is important to develop fault t...
The Grid environment is generic, heterogeneous, and dynamic with lots of unreliable resources making...
When executing highly demanding parallel/distributed applications on grids or non-dedicated parallel...
Abstract — Checkpointing is a typical approach to tolerate failures in today’s supercomputing cluste...
As grids typically consist of autonomously managed subsystems with strongly varying resources, fault...
By leveraging the enormous amount of computational capabilities, scientists today are being able to ...
International audienceThe emergence of dynamic execution environments such as Grids forces applicati...
InteGrade is a grid middleware infrastructure that enables the use of idle computing power from user...
The emergence of dynamic execution environments such as Grids forces applications to take dynamicity...
A grid is a distributed computational and storage environment often composed of heterogeneous autono...
International audienceIn large-scale Grid computing environments, providing fault-tolerance is requi...
High Performance Computing (HPC) systems represent the peak of modern computational capability. As ...
Migrating traditional scientific applications to computational Grids requires programming tools that...
Migrating traditional scientific applications to computational Grids requires programming tools that...
Migrating traditional scientific applications to computational Grids requires programming tools that...
Jobs in Grid workflows are exposed to different types of failure. It is important to develop fault t...
The Grid environment is generic, heterogeneous, and dynamic with lots of unreliable resources making...
When executing highly demanding parallel/distributed applications on grids or non-dedicated parallel...
Abstract — Checkpointing is a typical approach to tolerate failures in today’s supercomputing cluste...
As grids typically consist of autonomously managed subsystems with strongly varying resources, fault...
By leveraging the enormous amount of computational capabilities, scientists today are being able to ...
International audienceThe emergence of dynamic execution environments such as Grids forces applicati...
InteGrade is a grid middleware infrastructure that enables the use of idle computing power from user...
The emergence of dynamic execution environments such as Grids forces applications to take dynamicity...
A grid is a distributed computational and storage environment often composed of heterogeneous autono...
International audienceIn large-scale Grid computing environments, providing fault-tolerance is requi...
High Performance Computing (HPC) systems represent the peak of modern computational capability. As ...