Abstract—Optimized GPU kernels are sufficiently complicated to write that they often are specialized to input data, target architectures, or applications. This paper presents a multi-search abstraction for computing multiple breadth-first searches in par-allel and demonstrates a high-performance, general implementa-tion. Our abstraction removes the burden of orchestrating graph traversal from the user while providing high performance and low energy usage, an often overlooked component of algorithm design. Energy consumption has become a first-class hardware design constraint for both massive and embedded computing platforms. Our abstraction can be applied to such problems as the all-pairs shortest-path problem, community detection, reachabi...
Parallel graph algorithms have become one of the principal applications of high-performance computin...
An implementation of a newly developed parallel graph traversal algorithm on a new one-chip many-cor...
We have developed a multithreaded implementation of breadth-first search (BFS) of a sparse graph usi...
Abstract—The construction of efficient parallel graph al-gorithms is important for quickly solving p...
Breadth-first search (BFS) is an essential graph traversal strategy widely used in many computing ...
pre-printFast, scalable, low-cost, and low-power execution of parallel graph algorithms is important...
There is growing interest in studying large scale graphs having millions of vertices and billions of...
Data-intensive, graph-based computations are pervasive in several scientific applications, and are k...
Breadth-first search (BFS) is a widely used graph algorithm. It is data-intensive, and the data acce...
When working on graphs, reachability is among the most common problems to address, since it is the b...
We have developed a multithreaded implementation of breadth-first search (BFS) of a sparse graph usi...
Breadth-First Search is a graph traversal technique used in many applications as a building block, e...
On a GPU cluster, the ratio of high computing power to communication bandwidth makes scaling breadth...
Parallel Breadth-First Heuristic Search on a Shared-Memory Architecture We consider a breadth-first ...
With the increasing processing power of multicore computers, parallel graph search (or graph travers...
Parallel graph algorithms have become one of the principal applications of high-performance computin...
An implementation of a newly developed parallel graph traversal algorithm on a new one-chip many-cor...
We have developed a multithreaded implementation of breadth-first search (BFS) of a sparse graph usi...
Abstract—The construction of efficient parallel graph al-gorithms is important for quickly solving p...
Breadth-first search (BFS) is an essential graph traversal strategy widely used in many computing ...
pre-printFast, scalable, low-cost, and low-power execution of parallel graph algorithms is important...
There is growing interest in studying large scale graphs having millions of vertices and billions of...
Data-intensive, graph-based computations are pervasive in several scientific applications, and are k...
Breadth-first search (BFS) is a widely used graph algorithm. It is data-intensive, and the data acce...
When working on graphs, reachability is among the most common problems to address, since it is the b...
We have developed a multithreaded implementation of breadth-first search (BFS) of a sparse graph usi...
Breadth-First Search is a graph traversal technique used in many applications as a building block, e...
On a GPU cluster, the ratio of high computing power to communication bandwidth makes scaling breadth...
Parallel Breadth-First Heuristic Search on a Shared-Memory Architecture We consider a breadth-first ...
With the increasing processing power of multicore computers, parallel graph search (or graph travers...
Parallel graph algorithms have become one of the principal applications of high-performance computin...
An implementation of a newly developed parallel graph traversal algorithm on a new one-chip many-cor...
We have developed a multithreaded implementation of breadth-first search (BFS) of a sparse graph usi...