On Fault Tolerance for Distributed Iterative Dataflow Processing

Xu, Chen
Holzemer, Markus
Kaul, Manohar
Soto, Juan et. al.

Open link

Publication date

January 2017

DOI

10.1109/TKDE.2017.2690431

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

ISSN

1041-4347

Citation count (estimate)

Abstract

Large-scale graph and machine learning analytics widely employ distributed iterative processing. Typically, these analytics are a part of a comprehensive workflow, which includes data preparation, model building, and model evaluation. General-purpose distributed dataflow frameworks execute all steps of such workflows holistically. This holistic view enables these systems to reason about and automatically optimize the entire pipeline. Here, graph and machine learning analytics are known to incur a long runtime since they require multiple passes over the data until convergence is reached. Thus, fault tolerance and a fast-recovery from any intermittent failure is critical for efficient analysis. In this paper, we propose novel fault-tolerant m...

Extracted data

We use cookies to provide a better user experience.

Data Protection

On Fault Tolerance for Distributed Iterative Dataflow Processing

Abstract

Extracted data

On Fault Tolerance for Distributed Iterative Dataflow Processing

Abstract

Extracted data

Related items

Related items