The graph partitioning strategy plays a vital role in the overall execution of an algorithm in a distributed graph processing system. Choosing the best strategy is very challenging, as no one strategy is always the best fit for all kinds of graphs or algorithms. In this paper, we help users choosing a suitable partitioning strategy for algorithms based on the Pregel model by providing a cost model for the Pregel implementation in Spark-GraphX. The cost model shows the relationship between four major parameters: (1) input graph (2) cluster configuration (3) algorithm properties and (4) partitioning strategy. We validate the accuracy of the cost model on 17 different combinations of input graph, algorithm, and partition strategy. As such, the...
Even if Pregel scales better than MapReduce in graph processing by reducing iteration's disk I/O, wh...
As the study of large graphs over hundreds of gigabytes becomes increasingly popular for various dat...
International audienceGraph processing is an emerging computation model for a wide range of applicat...
The graph partitioning strategy plays a vital role in the overall execution of an algorithm in a dis...
The graph partitioning strategy plays a vital role in the overall execution of an algorithm in a dis...
We study the problem of implementing graph algorithms efficiently on Pregel-like systems, which can ...
In this age of information, data gathering has become a new growing trend. Social networking sites, ...
In this paper, we study the problem of choosing among partitioning strategies in distributed graph p...
In this thesis, we study the problem of choosing among partitioning strategies in distributed graph ...
Graphs in real life applications are often huge, such as the Web graph and various social networks. ...
Distributed graph processing systems such as Pregel, PowerGraph, or GraphX have gained popularity du...
Although using graphs to represent networks and relationship is not new; the size of network has bee...
Distributed vertex-centric graph processing systems such as Pregel, Giraph and GPS have acquired sig...
© 1989-2012 IEEE. Hypergraphs are generalizations of graphs where the (hyper)edges can connect any n...
Graph partitioning is considered to be a standard solution to process huge graphs efficiently when p...
Even if Pregel scales better than MapReduce in graph processing by reducing iteration's disk I/O, wh...
As the study of large graphs over hundreds of gigabytes becomes increasingly popular for various dat...
International audienceGraph processing is an emerging computation model for a wide range of applicat...
The graph partitioning strategy plays a vital role in the overall execution of an algorithm in a dis...
The graph partitioning strategy plays a vital role in the overall execution of an algorithm in a dis...
We study the problem of implementing graph algorithms efficiently on Pregel-like systems, which can ...
In this age of information, data gathering has become a new growing trend. Social networking sites, ...
In this paper, we study the problem of choosing among partitioning strategies in distributed graph p...
In this thesis, we study the problem of choosing among partitioning strategies in distributed graph ...
Graphs in real life applications are often huge, such as the Web graph and various social networks. ...
Distributed graph processing systems such as Pregel, PowerGraph, or GraphX have gained popularity du...
Although using graphs to represent networks and relationship is not new; the size of network has bee...
Distributed vertex-centric graph processing systems such as Pregel, Giraph and GPS have acquired sig...
© 1989-2012 IEEE. Hypergraphs are generalizations of graphs where the (hyper)edges can connect any n...
Graph partitioning is considered to be a standard solution to process huge graphs efficiently when p...
Even if Pregel scales better than MapReduce in graph processing by reducing iteration's disk I/O, wh...
As the study of large graphs over hundreds of gigabytes becomes increasingly popular for various dat...
International audienceGraph processing is an emerging computation model for a wide range of applicat...