Abstract. We describe a replication-based protocol that uses group communication for fault tolerance in the Computational Grid. The Grid is partitioned into a number of clusters and each cluster has a designated coordinator that manages the states of the replicas within its cluster. The coordinators belong to a process group and the proposed protocol ensures the correct sequence of message deliveries to the replicas by the coordinators. Any failing node of the Grid is replaced by an active replica to provide correct continuation of the operation of the application. We show the theoretical framework along with illustrations of the replication protocol and its implementation results and analyze its performance and scalability.
We address the challenge of sharing large amounts of numerical data within computing grids consistin...
This paper introduces a fault-tolerant group communication protocol that is aimed at grid and wide a...
We address the challenge of sharing large amounts of numerical data within computing grids consistin...
4th International Symposium on Parallel and Distributed Processing and Applications, ISPA 2006; Sorr...
Grid of computing nodes has emerged as a representative means of connecting distributed computers or...
Grid of computing nodes has emerged as a representative means of connecting distributed computers or...
Abstract. We introduce FTRepMI, a simple fault-tolerant protocol for providing sequential consistenc...
This paper addresses the challenge of transparent data sharing within computing grids built as clust...
This paper addresses the challenge of transparent data sharing within computing grids built as clust...
International audienceAs high performance platforms (Clusters, Grids, etc.) continue to grow in size...
International audienceAs high performance platforms (Clusters, Grids, etc.) continue to grow in size...
A peer-to-peer grid computing is complicated by heterogeneous capabilities, failures, volatility, an...
International audienceAs high performance platforms (Clusters, Grids, etc.) continue to grow in size...
To appear/http://www.interscience.wiley.comThis paper addresses the challenge of transparent data sh...
To appear/http://www.interscience.wiley.comThis paper addresses the challenge of transparent data sh...
We address the challenge of sharing large amounts of numerical data within computing grids consistin...
This paper introduces a fault-tolerant group communication protocol that is aimed at grid and wide a...
We address the challenge of sharing large amounts of numerical data within computing grids consistin...
4th International Symposium on Parallel and Distributed Processing and Applications, ISPA 2006; Sorr...
Grid of computing nodes has emerged as a representative means of connecting distributed computers or...
Grid of computing nodes has emerged as a representative means of connecting distributed computers or...
Abstract. We introduce FTRepMI, a simple fault-tolerant protocol for providing sequential consistenc...
This paper addresses the challenge of transparent data sharing within computing grids built as clust...
This paper addresses the challenge of transparent data sharing within computing grids built as clust...
International audienceAs high performance platforms (Clusters, Grids, etc.) continue to grow in size...
International audienceAs high performance platforms (Clusters, Grids, etc.) continue to grow in size...
A peer-to-peer grid computing is complicated by heterogeneous capabilities, failures, volatility, an...
International audienceAs high performance platforms (Clusters, Grids, etc.) continue to grow in size...
To appear/http://www.interscience.wiley.comThis paper addresses the challenge of transparent data sh...
To appear/http://www.interscience.wiley.comThis paper addresses the challenge of transparent data sh...
We address the challenge of sharing large amounts of numerical data within computing grids consistin...
This paper introduces a fault-tolerant group communication protocol that is aimed at grid and wide a...
We address the challenge of sharing large amounts of numerical data within computing grids consistin...