In this paper, we propose a scalable and fault-tolerant job scheduling framework for grid computing. The proposed framework loosely couples a dynamic job scheduling approach with the hybrid replications approach to schedule jobs efficiently while at the same time providing fault-tolerance. The novelty of the proposed framework is that it uses passive replication approach under high system load and active replication approach under low system loads. The switch between these two replication methods is also done dynamically and transparently
In this paper we propose a fault-tolerant scheduler for Bag-of-Tasks Grid applications, called WorkQ...
Computational grids have the potential for solving scientific and large - scale problems using heter...
10.1145/1272366.1272409Proceedings of the 16th International Symposium on High Performance Distribut...
Recent trends in grid computing development is moving towards a service-oriented architecture. With ...
One of the primary issues associated with the efficient and effective utilization of distributed com...
A grid is a distributed computational and storage environment often composed of heterogeneous autono...
As grids typically consist of autonomously managed subsystems with strongly varying resources, fault...
Abstract—Grid computing is an emerging technology which has the potential to solve large scale scien...
As grids typically consist of heterogeneously managed subsystems with strongly varying resources, re...
Grid computing is an effective distributed and adaptable processing network that manages a huge numb...
Workflow brokers of existing Grid Scheduling Systems are lack of cooperation mechanism which causes ...
Abstract- In grid computing, resources are used outside the boundary of organizations and it becomes...
Job checkpointing is one of the most common utilized techniques for providing fault tolerance in com...
One of the main problems in distributed high-performance computing is how to allocate, schedule, ef...
While fault-tolerance is desirable for grid applications because of the distributed and dynamic natu...
In this paper we propose a fault-tolerant scheduler for Bag-of-Tasks Grid applications, called WorkQ...
Computational grids have the potential for solving scientific and large - scale problems using heter...
10.1145/1272366.1272409Proceedings of the 16th International Symposium on High Performance Distribut...
Recent trends in grid computing development is moving towards a service-oriented architecture. With ...
One of the primary issues associated with the efficient and effective utilization of distributed com...
A grid is a distributed computational and storage environment often composed of heterogeneous autono...
As grids typically consist of autonomously managed subsystems with strongly varying resources, fault...
Abstract—Grid computing is an emerging technology which has the potential to solve large scale scien...
As grids typically consist of heterogeneously managed subsystems with strongly varying resources, re...
Grid computing is an effective distributed and adaptable processing network that manages a huge numb...
Workflow brokers of existing Grid Scheduling Systems are lack of cooperation mechanism which causes ...
Abstract- In grid computing, resources are used outside the boundary of organizations and it becomes...
Job checkpointing is one of the most common utilized techniques for providing fault tolerance in com...
One of the main problems in distributed high-performance computing is how to allocate, schedule, ef...
While fault-tolerance is desirable for grid applications because of the distributed and dynamic natu...
In this paper we propose a fault-tolerant scheduler for Bag-of-Tasks Grid applications, called WorkQ...
Computational grids have the potential for solving scientific and large - scale problems using heter...
10.1145/1272366.1272409Proceedings of the 16th International Symposium on High Performance Distribut...