Designing distributed graph systems has drawn a lot of research interests due to the strong expressiveness of the graph model and rapidly increasing graph volume. Most of them require the graph data and all intermediate messages to reside in main memory, which may sacrifice the scalability. Even though several disk-based systems have been studied to remedy such issue, several challenges still exist in achieving both high computational efficiency and low network communication under the limitation of memory usage. In this paper, we design a novel disk-based distributed graph system, called ScaleG. The system provides a series of user-friendly programming interfaces. Unlike previous systems, the programmer in ScaleG does not need to concern an...
Computing connected components is a core operation on graph data. Since billion-scale graphs cannot ...
Even if Pregel scales better than MapReduce in graph processing by reducing iteration's disk I/O, wh...
The importance of high-performance graph processing to solve big data problems targeting high-impact...
The world is becoming a more conjunct place and the number of data sources such as social networks, ...
The world is becoming a more conjunct place and the number of data sources such as social networks, ...
Abstract—Graph algorithms on distributed-memory systems typically perform heavy communication, often...
Graphs have become increasingly important to represent highly-interconnected structures and schema-l...
Graph processing is increasingly popular in a variety of scientific and engineering domains. Consequ...
Current systems for graph computation require a distributed computing cluster to handle very large r...
We are witnessing an enormous growth in social networks as well as in the volume of data generated b...
With the prevalence of graph data in real-world applications (e.g., social networks, mobile phone ne...
Existing distributed graph processing frameworks, e.g., Pregel, Gi-raph, GPS and GraphLab, mainly ex...
The amount of data generated every day is growing exponentially in the big data era. A significant p...
Abstract—Small distributed systems are limited by their main memory to generate massively large grap...
The last decade has seen an increased attention on large-scale data analysis, caused mainly by the a...
Computing connected components is a core operation on graph data. Since billion-scale graphs cannot ...
Even if Pregel scales better than MapReduce in graph processing by reducing iteration's disk I/O, wh...
The importance of high-performance graph processing to solve big data problems targeting high-impact...
The world is becoming a more conjunct place and the number of data sources such as social networks, ...
The world is becoming a more conjunct place and the number of data sources such as social networks, ...
Abstract—Graph algorithms on distributed-memory systems typically perform heavy communication, often...
Graphs have become increasingly important to represent highly-interconnected structures and schema-l...
Graph processing is increasingly popular in a variety of scientific and engineering domains. Consequ...
Current systems for graph computation require a distributed computing cluster to handle very large r...
We are witnessing an enormous growth in social networks as well as in the volume of data generated b...
With the prevalence of graph data in real-world applications (e.g., social networks, mobile phone ne...
Existing distributed graph processing frameworks, e.g., Pregel, Gi-raph, GPS and GraphLab, mainly ex...
The amount of data generated every day is growing exponentially in the big data era. A significant p...
Abstract—Small distributed systems are limited by their main memory to generate massively large grap...
The last decade has seen an increased attention on large-scale data analysis, caused mainly by the a...
Computing connected components is a core operation on graph data. Since billion-scale graphs cannot ...
Even if Pregel scales better than MapReduce in graph processing by reducing iteration's disk I/O, wh...
The importance of high-performance graph processing to solve big data problems targeting high-impact...