Abstract—MapReduce is often used for critical data processing, e.g., in the context of scientific or financial simulation. However, there is evidence in the literature that there are arbitrary (or Byzantine) faults that may corrupt the results of MapReduce without being detected. We present a Byzantine fault-tolerant MapReduce framework that can run in two modes: nonspeculative and speculative. We thoroughly evaluate experimentally the performance of these two versions of the framework, showing that they use around twice more resources than Hadoop MapReduce, instead of the three times more of alternative solutions. We believe this cost is acceptable for many critical applications. Index Terms—Hadoop, MapReduce, Byzantine fault tolerance
Part 1: Distributed ProtocolsInternational audienceThe overall performance improvement in Byzantine ...
In this paper, we argue for the need and benefits for providing Byzantine fault tolerance as a servi...
All practical applications contain some degree of nondeterminism. When such applications are replica...
MapReduce is often used to run critical jobs such as scientific data analysis. However, evidence in ...
Abstract—MapReduce is a framework for processing large data sets largely used in cloud computing. Ma...
Tese de mestrado em Informática, apresentada à Universidade de Lisboa, através da Faculdade de Ciênc...
All in-text references underlined in blue are linked to publications on ResearchGate, letting you ac...
Byzantine fault tolerance has been intensively studied over the past decade as a way to enhance the ...
Practically, Byzantine faults are not assumed in cloud applications. Byzantine fault-tolerance adds ...
textByzantine fault-tolerance techniques are useful because they tolerate arbitrary faults regardle...
The popularity of MapReduce programming model has increased interest in the research community for i...
Cloud computing has emerged as popular paradigm that enables the establishment of large scale, flexi...
Byzantine faults in distributed systems can have very destructive consequences for services built on...
The primary concern of traditional Byzantine fault tolerance is to ensure strong replica consistency...
Since its introduction nearly 20 years ago, the Byzantine Generals Problem has been the subject of m...
Part 1: Distributed ProtocolsInternational audienceThe overall performance improvement in Byzantine ...
In this paper, we argue for the need and benefits for providing Byzantine fault tolerance as a servi...
All practical applications contain some degree of nondeterminism. When such applications are replica...
MapReduce is often used to run critical jobs such as scientific data analysis. However, evidence in ...
Abstract—MapReduce is a framework for processing large data sets largely used in cloud computing. Ma...
Tese de mestrado em Informática, apresentada à Universidade de Lisboa, através da Faculdade de Ciênc...
All in-text references underlined in blue are linked to publications on ResearchGate, letting you ac...
Byzantine fault tolerance has been intensively studied over the past decade as a way to enhance the ...
Practically, Byzantine faults are not assumed in cloud applications. Byzantine fault-tolerance adds ...
textByzantine fault-tolerance techniques are useful because they tolerate arbitrary faults regardle...
The popularity of MapReduce programming model has increased interest in the research community for i...
Cloud computing has emerged as popular paradigm that enables the establishment of large scale, flexi...
Byzantine faults in distributed systems can have very destructive consequences for services built on...
The primary concern of traditional Byzantine fault tolerance is to ensure strong replica consistency...
Since its introduction nearly 20 years ago, the Byzantine Generals Problem has been the subject of m...
Part 1: Distributed ProtocolsInternational audienceThe overall performance improvement in Byzantine ...
In this paper, we argue for the need and benefits for providing Byzantine fault tolerance as a servi...
All practical applications contain some degree of nondeterminism. When such applications are replica...