Abstract. With the maturity of the Grid, the community has made an important effort in developing middleware and tools to provide differ-ent functionalities, such as resource discovery, resource management, job submission, execution monitoring. Unfortunately, not so much attention has been paid in developing mechanism to provide fault-tolerance to ex-ecution of applications on a Grid. This paper addresses the design and implementation of an architecture based on services (CPPC-G) to man-age the execution of fault tolerant parallel applications on Grids. The CPPC (Controller/Precompiler for Portable Checkpointing) framework is used to insert checkpoint instrumentation into the application code. Designed services will be in charge of submissi...
Abstract- In grid computing, resources are used outside the boundary of organizations and it becomes...
This paper introduces a novel approach in parallel checkpointing aimed at supporting fault-tolerance...
Enabling applications for computational Grids requires new approaches to develop applications that c...
Abstract. The Grid community has made an important effort in developing middleware to provide differ...
Jobs in Grid workflows are exposed to different types of failure. It is important to develop fault t...
InteGrade is a grid middleware infrastructure that enables the use of idle computing power from user...
compiler for Portable Checkpointing), a checkpointing tool designed for heterogeneous clusters and G...
Despite the increasing popularity of shared-memory systems, there is a lack of tools for providing f...
Abstract—The GridRPC model is well suited for high per-formance computing on grids thanks to efficie...
The ability to tolerate failures while effectively exploiting the grid computing resources in an sca...
Grid computing uses massive power of idle cycles of PC's.Desktop grids is nothing using the idl...
The EU-funded XtreemOS project implements a grid operating system (OS) transparently exploiting dist...
With the evolution of high-performance computing towards heterogeneous, massively par-allel systems,...
The paper describes a parallel program checkpointing mechanism and its potential application in Grid...
The Grid environment is generic, heterogeneous, and dynamic with lots of unreliable resources making...
Abstract- In grid computing, resources are used outside the boundary of organizations and it becomes...
This paper introduces a novel approach in parallel checkpointing aimed at supporting fault-tolerance...
Enabling applications for computational Grids requires new approaches to develop applications that c...
Abstract. The Grid community has made an important effort in developing middleware to provide differ...
Jobs in Grid workflows are exposed to different types of failure. It is important to develop fault t...
InteGrade is a grid middleware infrastructure that enables the use of idle computing power from user...
compiler for Portable Checkpointing), a checkpointing tool designed for heterogeneous clusters and G...
Despite the increasing popularity of shared-memory systems, there is a lack of tools for providing f...
Abstract—The GridRPC model is well suited for high per-formance computing on grids thanks to efficie...
The ability to tolerate failures while effectively exploiting the grid computing resources in an sca...
Grid computing uses massive power of idle cycles of PC's.Desktop grids is nothing using the idl...
The EU-funded XtreemOS project implements a grid operating system (OS) transparently exploiting dist...
With the evolution of high-performance computing towards heterogeneous, massively par-allel systems,...
The paper describes a parallel program checkpointing mechanism and its potential application in Grid...
The Grid environment is generic, heterogeneous, and dynamic with lots of unreliable resources making...
Abstract- In grid computing, resources are used outside the boundary of organizations and it becomes...
This paper introduces a novel approach in parallel checkpointing aimed at supporting fault-tolerance...
Enabling applications for computational Grids requires new approaches to develop applications that c...