Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative nature of many analysis and machine learning algorithms, however, is still a challenge for current systems. While certain types of bulk iterative algorithms are supported by novel dataflow frameworks, these systems cannot exploit compu-tational dependencies present in many algorithms, such as graph algorithms. As a result, these algorithms are inefficiently executed and have led to specialized systems based on other paradigms, such as message passing or shared memory. We propose a method to integrate incremental iterations, a form of workset iterations, with parallel dataflows. After showing how to integrate bulk iterations into a dataflow sy...
In the foreseeable future, high-performance supercomputers will continue to evolve in the direction ...
International audienceDynamic workflows are scientific workflows to support computational science si...
<p>Many modern machine learning (ML) algorithms are iterative, converging on a final solution via ma...
Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative ...
This work addresses the need for stateful dataflow programs that can rapidly sift through huge, evol...
The term "dataflow" generally encompasses three distinct aspects of computation - a data-driven mode...
Large-scale graph and machine learning analytics widely employ distributed iterative processing. Typ...
In this thesis, we address the problem of efficiently and automatically scaling iterative computatio...
This is an extended version of Modeling Big Data Processing Programs, by Joao Batista de Souza Neto,...
International audienceThis paper proposes a model for specifying data flow-based parallel data proc...
The iterative algorithm is widely used to solve instances of data-flow analysis problems. The algori...
textThe unprecedented and exponential growth of data along with the advent of multi-core processors...
Real-world graph processing applications often require combining the graph data with tabular data. M...
The dataflow model of computation exposes and exploits parallelism in programs without requiring p...
Cloud intelligence applications often perform iterative computa-tions (e.g., PageRank) on constantly...
In the foreseeable future, high-performance supercomputers will continue to evolve in the direction ...
International audienceDynamic workflows are scientific workflows to support computational science si...
<p>Many modern machine learning (ML) algorithms are iterative, converging on a final solution via ma...
Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative ...
This work addresses the need for stateful dataflow programs that can rapidly sift through huge, evol...
The term "dataflow" generally encompasses three distinct aspects of computation - a data-driven mode...
Large-scale graph and machine learning analytics widely employ distributed iterative processing. Typ...
In this thesis, we address the problem of efficiently and automatically scaling iterative computatio...
This is an extended version of Modeling Big Data Processing Programs, by Joao Batista de Souza Neto,...
International audienceThis paper proposes a model for specifying data flow-based parallel data proc...
The iterative algorithm is widely used to solve instances of data-flow analysis problems. The algori...
textThe unprecedented and exponential growth of data along with the advent of multi-core processors...
Real-world graph processing applications often require combining the graph data with tabular data. M...
The dataflow model of computation exposes and exploits parallelism in programs without requiring p...
Cloud intelligence applications often perform iterative computa-tions (e.g., PageRank) on constantly...
In the foreseeable future, high-performance supercomputers will continue to evolve in the direction ...
International audienceDynamic workflows are scientific workflows to support computational science si...
<p>Many modern machine learning (ML) algorithms are iterative, converging on a final solution via ma...