In this thesis, we propose optimization techniques for distributed graph processing. First, we describe a data processing pipeline that leverages an iterative graph algorithm for automatic classification of web trackers. Using this application as a motivating example, we examine how asymmetrical convergence of iterative graph algorithms can be used to reduce the amount of computation and communication in large-scale graph analysis. We propose an optimization framework for fixpoint algorithms and a declarative API for writing fixpoint applications. Our framework uses a cost model to automatically exploit asymmetrical convergence and evaluate execution strategies during runtime. We show that our cost model achieves speedup of up to 1.7x and c...
Thinking Like A Vertex (TLAV) is a popular computational paradigm suitable to express many distribut...
The interest in the ability of processing data that has an underlying graph structure is grown in th...
While various iterative graph algorithms can be expressed via asynchronous parallelism, lack of its ...
In this thesis, we propose optimization techniques for distributed graph processing. First, we descr...
Distributed, shared-nothing architectures of commodity machines are a popular design choice for the ...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
The last decade has seen an increased attention on large-scale data analysis, caused mainly by the a...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
The amount of data generated every day is growing exponentially in the big data era. A significant p...
Most data in today's world can be represented in a graph form, and these graphs can then be used as ...
A graph is a ubiquitous data structure that models entities and their interactions through the colle...
The world is becoming a more conjunct place and the number of data sources such as social networks, ...
The world is becoming a more conjunct place and the number of data sources such as social networks, ...
Graph processing is increasingly popular in a variety of scientific and engineering domains. Consequ...
To efficiently process time-evolving graphs where new vertices and edges are inserted over time, an ...
Thinking Like A Vertex (TLAV) is a popular computational paradigm suitable to express many distribut...
The interest in the ability of processing data that has an underlying graph structure is grown in th...
While various iterative graph algorithms can be expressed via asynchronous parallelism, lack of its ...
In this thesis, we propose optimization techniques for distributed graph processing. First, we descr...
Distributed, shared-nothing architectures of commodity machines are a popular design choice for the ...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
The last decade has seen an increased attention on large-scale data analysis, caused mainly by the a...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
The amount of data generated every day is growing exponentially in the big data era. A significant p...
Most data in today's world can be represented in a graph form, and these graphs can then be used as ...
A graph is a ubiquitous data structure that models entities and their interactions through the colle...
The world is becoming a more conjunct place and the number of data sources such as social networks, ...
The world is becoming a more conjunct place and the number of data sources such as social networks, ...
Graph processing is increasingly popular in a variety of scientific and engineering domains. Consequ...
To efficiently process time-evolving graphs where new vertices and edges are inserted over time, an ...
Thinking Like A Vertex (TLAV) is a popular computational paradigm suitable to express many distribut...
The interest in the ability of processing data that has an underlying graph structure is grown in th...
While various iterative graph algorithms can be expressed via asynchronous parallelism, lack of its ...