On a GPU cluster, the ratio of high computing power to communication bandwidth makes scaling breadth-first search (BFS) on a scale-free graph extremely challenging. By separating high and low out-degree vertices, we present an implementation with scalable computation and a model for scalable communication for BFS and direction-optimized BFS. Our communication model uses global reduction for high-degree vertices, and point-to-point transmission for low-degree vertices. Leveraging the characteristics of degree separation, we reduce the graph size to one third of the conventional edge list representation. With several other optimizations, we observe linear weak scaling as we increase the number of GPUs, and achieve 259.8 GTEPS on a scale-33 Gr...
Abstract—We present techniques to process large scale-free graphs in distributed memory. Our aim is ...
Abstract—Recent GPU algorithms for constructing spatial hierarchies have achieved promising performa...
This chapter studies the problem of traversing large graphs using the breadth-first search order on ...
There is growing interest in studying large scale graphs having millions of vertices and billions of...
pre-printFast, scalable, low-cost, and low-power execution of parallel graph algorithms is important...
Many emerging large-scale data science applications require searching large graphs dis-tributed acro...
Breadth-First Search is an important kernel used by many graph-processing applications. In many of t...
We present the results obtained by using an evolution of our CUDA-based solution for the exploration...
Data-intensive, graph-based computations are pervasive in several scientific applications, and are k...
Abstract—Optimized GPU kernels are sufficiently complicated to write that they often are specialized...
Abstract—The construction of efficient parallel graph al-gorithms is important for quickly solving p...
Graph data structures have been used in a wide range of applications including scientific and social...
Many emerging large-scale data science applications require searching large graphs distributed acros...
Breadth-first search (BFS) is an essential graph traversal strategy widely used in many computing ...
Breadth-first search (BFS) is a widely used graph algorithm. It is data-intensive, and the data acce...
Abstract—We present techniques to process large scale-free graphs in distributed memory. Our aim is ...
Abstract—Recent GPU algorithms for constructing spatial hierarchies have achieved promising performa...
This chapter studies the problem of traversing large graphs using the breadth-first search order on ...
There is growing interest in studying large scale graphs having millions of vertices and billions of...
pre-printFast, scalable, low-cost, and low-power execution of parallel graph algorithms is important...
Many emerging large-scale data science applications require searching large graphs dis-tributed acro...
Breadth-First Search is an important kernel used by many graph-processing applications. In many of t...
We present the results obtained by using an evolution of our CUDA-based solution for the exploration...
Data-intensive, graph-based computations are pervasive in several scientific applications, and are k...
Abstract—Optimized GPU kernels are sufficiently complicated to write that they often are specialized...
Abstract—The construction of efficient parallel graph al-gorithms is important for quickly solving p...
Graph data structures have been used in a wide range of applications including scientific and social...
Many emerging large-scale data science applications require searching large graphs distributed acros...
Breadth-first search (BFS) is an essential graph traversal strategy widely used in many computing ...
Breadth-first search (BFS) is a widely used graph algorithm. It is data-intensive, and the data acce...
Abstract—We present techniques to process large scale-free graphs in distributed memory. Our aim is ...
Abstract—Recent GPU algorithms for constructing spatial hierarchies have achieved promising performa...
This chapter studies the problem of traversing large graphs using the breadth-first search order on ...