In this thesis, we propose optimization techniques for distributed graph processing. First, we describe a data processing pipeline that leverages an iterative graph algorithm for automatic classification of web trackers. Using this application as a motivating example, we examine how asymmetrical convergence of iterative graph algorithms can be used to reduce the amount of computation and communication in large-scale graph analysis. We propose an optimization framework for fixpoint algorithms and a declarative API for writing fixpoint applications. Our framework uses a cost model to automatically exploit asymmetrical convergence and evaluate execution strategies during runtime. We show that our cost model achieves speedup of up to 1.7x and c...
The interest in the ability of processing data that has an underlying graph structure is grown in th...
Thinking Like A Vertex (TLAV) is a popular computational paradigm suitable to express many distribut...
Efficiently processing large graphs is challenging, since parallel graph algorithms suffer from poor...
In this thesis, we propose optimization techniques for distributed graph processing. First, we descr...
Distributed, shared-nothing architectures of commodity machines are a popular design choice for the ...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
The last decade has seen an increased attention on large-scale data analysis, caused mainly by the a...
The amount of data generated every day is growing exponentially in the big data era. A significant p...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
Most data in today's world can be represented in a graph form, and these graphs can then be used as ...
A graph is a ubiquitous data structure that models entities and their interactions through the colle...
The world is becoming a more conjunct place and the number of data sources such as social networks, ...
The world is becoming a more conjunct place and the number of data sources such as social networks, ...
Graph processing is increasingly popular in a variety of scientific and engineering domains. Consequ...
To efficiently process time-evolving graphs where new vertices and edges are inserted over time, an ...
The interest in the ability of processing data that has an underlying graph structure is grown in th...
Thinking Like A Vertex (TLAV) is a popular computational paradigm suitable to express many distribut...
Efficiently processing large graphs is challenging, since parallel graph algorithms suffer from poor...
In this thesis, we propose optimization techniques for distributed graph processing. First, we descr...
Distributed, shared-nothing architectures of commodity machines are a popular design choice for the ...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
The last decade has seen an increased attention on large-scale data analysis, caused mainly by the a...
The amount of data generated every day is growing exponentially in the big data era. A significant p...
Both researchers and industry are confronted with the need to process increasingly large amounts of ...
Most data in today's world can be represented in a graph form, and these graphs can then be used as ...
A graph is a ubiquitous data structure that models entities and their interactions through the colle...
The world is becoming a more conjunct place and the number of data sources such as social networks, ...
The world is becoming a more conjunct place and the number of data sources such as social networks, ...
Graph processing is increasingly popular in a variety of scientific and engineering domains. Consequ...
To efficiently process time-evolving graphs where new vertices and edges are inserted over time, an ...
The interest in the ability of processing data that has an underlying graph structure is grown in th...
Thinking Like A Vertex (TLAV) is a popular computational paradigm suitable to express many distribut...
Efficiently processing large graphs is challenging, since parallel graph algorithms suffer from poor...