Abstract—Myriad of graph-based algorithms in machine learning and data mining require parsing relational data iteratively. These algorithms are implemented in a large-scale distributed environment in order to scale to massive data sets. To accelerate these large-scale graph-based iterative computations, we propose delta-based accumulative iterative computation (DAIC). Different from traditional iterative computations, which iteratively update the result based on the result from the previous iteration, DAIC updates the result by accumulating the “changes ” between iterations. By DAIC, we can process only the “changes ” to avoid the negligible updates. Furthermore, we can perform DAIC asynchronously to bypass the high-cost synchronous barrier...
While high-level data parallel frameworks, like MapReduce, sim-plify the design and implementation o...
Whenever the term “Big Data” was mentioned, it was often closely associated with technologies like A...
While high-level data parallel frameworks, like MapReduce, sim-plify the design and implementation o...
Abstract—This supplementary file contains the supporting materials of the TPDS manuscript “Maiter: ...
Abstract—Iterative computations are pervasive among data analysis applications, including Web search...
In today’s Web and social network environments, query workloads include ad hoc and OLAP queries, as ...
While various iterative graph algorithms can be expressed via asynchronous parallelism, lack of its ...
In today’s Web and social network environments, query workloads include ad hoc and OLAP queries, as ...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
In this thesis, we propose optimization techniques for distributed graph processing. First, we descr...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
Future High Performance Computing (HPC) nodes will have many more processors than the contemporary a...
To efficiently process time-evolving graphs where new vertices and edges are inserted over time, an ...
Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative ...
Abstract It is true that data is never static; it keeps growing and changing over time. New data is ...
While high-level data parallel frameworks, like MapReduce, sim-plify the design and implementation o...
Whenever the term “Big Data” was mentioned, it was often closely associated with technologies like A...
While high-level data parallel frameworks, like MapReduce, sim-plify the design and implementation o...
Abstract—This supplementary file contains the supporting materials of the TPDS manuscript “Maiter: ...
Abstract—Iterative computations are pervasive among data analysis applications, including Web search...
In today’s Web and social network environments, query workloads include ad hoc and OLAP queries, as ...
While various iterative graph algorithms can be expressed via asynchronous parallelism, lack of its ...
In today’s Web and social network environments, query workloads include ad hoc and OLAP queries, as ...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
In this thesis, we propose optimization techniques for distributed graph processing. First, we descr...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
Future High Performance Computing (HPC) nodes will have many more processors than the contemporary a...
To efficiently process time-evolving graphs where new vertices and edges are inserted over time, an ...
Parallel dataflow systems are a central part of most analytic pipelines for big data. The iterative ...
Abstract It is true that data is never static; it keeps growing and changing over time. New data is ...
While high-level data parallel frameworks, like MapReduce, sim-plify the design and implementation o...
Whenever the term “Big Data” was mentioned, it was often closely associated with technologies like A...
While high-level data parallel frameworks, like MapReduce, sim-plify the design and implementation o...