Abstract — To achieve correct execution of peer-to-peer applications on non-reliable resources, we present a portable and distributed algorithm that provides fault tolerance and result checking. Two kinds of faults are considered: node failure or disconnection and result forgery. This algo-rithm is based on the knowledge of the macro data-flow dependencies be-tween the application tasks. It provides correct execution with respect to a probabilistic certificate. We have implemented it on top of Athapascan programming interface and experimental results are presented
A framework for modelling and formal verification of distributed algorithms developed by Reisig et a...
As today\u27s distributed applications increase in complexity, it becomes increasingly difficult to ...
Abstract—P2P computing platforms are subject to a wide range of attacks. In this paper, we propose a...
Abstract — To achieve correct execution of peer-to-peer applications on non-reliable resources, we p...
To achieve correct execution of peer-to-peer applications on non-reliable resources, we present a po...
Abstract. The partitioning of a long running task into smaller tasks that are executed separately in...
Distributed systems and extreme-scale systems are ubiquitous in recent years and have seen throughou...
Large scale cluster, Peer-to-Peer computing systems and grid computer systems gather thousands of no...
Abstract—Fault tolerance issues related to the implementation of distributed iterative algorithms vi...
Ubiquitous connectivity of web, mobile, and IoT computing platforms has fostered a variety of distri...
It is argued that there is a significant class of pipelined large grain data flow computations whose...
This work presents a new comparison-based diagnosis model and a new algorithm, called Hi-Dif, based ...
Distributed computing is inherently based on replication, promising in-creased tolerance to failures...
Handling faults is a growing concern in HPC; higher error rates, larger detection intervals and sile...
Maintaining performance in a faulty distributed computing environment is a major challenge in the de...
A framework for modelling and formal verification of distributed algorithms developed by Reisig et a...
As today\u27s distributed applications increase in complexity, it becomes increasingly difficult to ...
Abstract—P2P computing platforms are subject to a wide range of attacks. In this paper, we propose a...
Abstract — To achieve correct execution of peer-to-peer applications on non-reliable resources, we p...
To achieve correct execution of peer-to-peer applications on non-reliable resources, we present a po...
Abstract. The partitioning of a long running task into smaller tasks that are executed separately in...
Distributed systems and extreme-scale systems are ubiquitous in recent years and have seen throughou...
Large scale cluster, Peer-to-Peer computing systems and grid computer systems gather thousands of no...
Abstract—Fault tolerance issues related to the implementation of distributed iterative algorithms vi...
Ubiquitous connectivity of web, mobile, and IoT computing platforms has fostered a variety of distri...
It is argued that there is a significant class of pipelined large grain data flow computations whose...
This work presents a new comparison-based diagnosis model and a new algorithm, called Hi-Dif, based ...
Distributed computing is inherently based on replication, promising in-creased tolerance to failures...
Handling faults is a growing concern in HPC; higher error rates, larger detection intervals and sile...
Maintaining performance in a faulty distributed computing environment is a major challenge in the de...
A framework for modelling and formal verification of distributed algorithms developed by Reisig et a...
As today\u27s distributed applications increase in complexity, it becomes increasingly difficult to ...
Abstract—P2P computing platforms are subject to a wide range of attacks. In this paper, we propose a...