In this paper, we present a graph partitioning algorithm to partition graphs with trillions of edges. To achieve such scale, our solution leverages the vertex-centric Pregel abstraction provided by Giraph, a system for large-scale graph analytics. We designed our algorithm to compute partitions with high locality and fair balance, and focused on the characteristics necessary to reach wide adoption by practitioners in production. Our solution can (i) scale to massive graphs and thousands of compute cores, (ii) efficiently adapt partitions to changes to graphs and compute environments, and (iii) seamlessly integrate in existing systems without additional infrastructure. We evaluate our solution on the Facebook and Instagram graphs, as well as...
This dissertation addresses the problem of dynamic graph partitioning in a streaming manner in the c...
This dissertation addresses the problem of dynamic graph partitioning in a streaming manner in the c...
The last decade has seen an increased attention on large-scale data analysis, caused mainly by the a...
Several organizations, like social networks, store and routinely an-alyze large graphs as part of th...
As the study of large graphs over hundreds of gigabytes becomes increasingly popular for various dat...
As the study of large graphs over hundreds of gigabytes becomes increasingly popular for various dat...
As the study of large graphs over hundreds of gigabytes becomes increasingly popular for various dat...
As the size and variety of information networks continue to grow in many scientific and engineering ...
Large-scale graph-structured datasets are growing at an increasing rate. Social network graphs are a...
Large-scale graph-structured datasets are growing at an increasing rate. Social network graphs are a...
Distributed graph processing systems such as Pregel, PowerGraph, or GraphX have gained popularity du...
Balanced graph partitioning in the streaming setting is a key problem to enable scalable and efficie...
Balanced graph partitioning in the streaming setting is a key problem to enable scalable and efficie...
Balanced graph partitioning in the streaming setting is a key problem to enable scalable and efficie...
Balanced graph partitioning in the streaming setting is a key problem to enable scalable and efficie...
This dissertation addresses the problem of dynamic graph partitioning in a streaming manner in the c...
This dissertation addresses the problem of dynamic graph partitioning in a streaming manner in the c...
The last decade has seen an increased attention on large-scale data analysis, caused mainly by the a...
Several organizations, like social networks, store and routinely an-alyze large graphs as part of th...
As the study of large graphs over hundreds of gigabytes becomes increasingly popular for various dat...
As the study of large graphs over hundreds of gigabytes becomes increasingly popular for various dat...
As the study of large graphs over hundreds of gigabytes becomes increasingly popular for various dat...
As the size and variety of information networks continue to grow in many scientific and engineering ...
Large-scale graph-structured datasets are growing at an increasing rate. Social network graphs are a...
Large-scale graph-structured datasets are growing at an increasing rate. Social network graphs are a...
Distributed graph processing systems such as Pregel, PowerGraph, or GraphX have gained popularity du...
Balanced graph partitioning in the streaming setting is a key problem to enable scalable and efficie...
Balanced graph partitioning in the streaming setting is a key problem to enable scalable and efficie...
Balanced graph partitioning in the streaming setting is a key problem to enable scalable and efficie...
Balanced graph partitioning in the streaming setting is a key problem to enable scalable and efficie...
This dissertation addresses the problem of dynamic graph partitioning in a streaming manner in the c...
This dissertation addresses the problem of dynamic graph partitioning in a streaming manner in the c...
The last decade has seen an increased attention on large-scale data analysis, caused mainly by the a...