In this thesis, we address the problem of efficiently and automatically scaling iterative computational applications through parallel programming frameworks. While there has been much progress in designing and developing parallel platforms with high level programming paradigms for batch-oriented applications, these platforms are ill-fitted for iterative computations due to their ignorance of resident data and enforcement of "embarrassingly parallel" batch-style processing of data sets within every computational operators. To address these challenges we propose a set of methods that leverage certain properties of iterative computations to enhance the performance of the resulting parallel programs for these large-scale iterative applications....
Recent advances in sensing, storage, and networking technologies are creating massive amounts of dat...
Increased programmability for concurrent applications in distributed systems requires automatic supp...
International audienceThis paper presents many typical problems that are encountered when executing ...
Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative ...
Parallel computing hardware is ubiquitous, ranging from cell-phones with multiple cores to super-com...
Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative ...
Many machine learning algorithms iteratively process datapoints and transform global model parameter...
In parallel object-oriented languages it is hard to elegantly express efficient data-parallel operat...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Parallelization of sequential programs is often daunting because of the substantial development cost...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
We present dynamic control replication, a run-time program analysis that enables scalable execution ...
Many large-scale machine learning (ML) applications use it-erative algorithms to converge on paramet...
International audienceComputing in parallel means performing computation simultaneously, this genera...
Many large-scale machine learning (ML) applications use it-erative algorithms to converge on paramet...
Recent advances in sensing, storage, and networking technologies are creating massive amounts of dat...
Increased programmability for concurrent applications in distributed systems requires automatic supp...
International audienceThis paper presents many typical problems that are encountered when executing ...
Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative ...
Parallel computing hardware is ubiquitous, ranging from cell-phones with multiple cores to super-com...
Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative ...
Many machine learning algorithms iteratively process datapoints and transform global model parameter...
In parallel object-oriented languages it is hard to elegantly express efficient data-parallel operat...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Parallelization of sequential programs is often daunting because of the substantial development cost...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
We present dynamic control replication, a run-time program analysis that enables scalable execution ...
Many large-scale machine learning (ML) applications use it-erative algorithms to converge on paramet...
International audienceComputing in parallel means performing computation simultaneously, this genera...
Many large-scale machine learning (ML) applications use it-erative algorithms to converge on paramet...
Recent advances in sensing, storage, and networking technologies are creating massive amounts of dat...
Increased programmability for concurrent applications in distributed systems requires automatic supp...
International audienceThis paper presents many typical problems that are encountered when executing ...