High-performance implementations of graph algorithms are challenging to implement on new parallel hardware such as GPUs because of three challenges: (1) the difficulty of coming up with graph building blocks, (2) load imbalance on parallel hardware, and (3) graph problems having low arithmetic intensity. To address some of these challenges, GraphBLAS is an innovative, on-going effort by the graph analytics community to propose building blocks based on sparse linear algebra, which allow graph algorithms to be expressed in a performant, succinct, composable, and portable manner. In this paper, we examine the performance challenges of a linear-algebra-based approach to building graph frameworks and describe new design principles for overcoming...
Optimizing linear algebra operations has been a research topic for decades. The compact languag...
This dissertation advances the state of the art for scalable high-performance graph analytics and da...
For large-scale graph analytics on the GPU, the irregularity of dataaccess/control flow and the comp...
High-performance implementations of graph algorithms are challenging to implement on new parallel ha...
We design and implement parallel graph coloring algorithms on the GPU using two different abstractio...
For large-scale graph analytics on the GPU, the irregularity of data access and control flow, and th...
Graphs are de facto data structures for many applications, and efficient graph processing is a must ...
The growing use of graph in many fields has sparked a broad interest in developing high-level graph ...
Abstract—Graphs are common data structures for many applications, and efficient graph processing is ...
The stagnant performance of single core processors, increasing size of data sets, and variety of str...
We factor Beamer’s push-pull, also known as direction-optimized breadth-first-search (DOBFS) into 3 ...
In this thesis we investigate the relation between the structure of input graphs and the performance...
We present a single-node, multi-GPU programmable graph processing library that allows programmers to...
Multi-core and GPU-based systems offer unprecedented computational power. They are, however, challen...
Despite the fact that GPU was originally intended to be as a co-processor specializing in graphics r...
Optimizing linear algebra operations has been a research topic for decades. The compact languag...
This dissertation advances the state of the art for scalable high-performance graph analytics and da...
For large-scale graph analytics on the GPU, the irregularity of dataaccess/control flow and the comp...
High-performance implementations of graph algorithms are challenging to implement on new parallel ha...
We design and implement parallel graph coloring algorithms on the GPU using two different abstractio...
For large-scale graph analytics on the GPU, the irregularity of data access and control flow, and th...
Graphs are de facto data structures for many applications, and efficient graph processing is a must ...
The growing use of graph in many fields has sparked a broad interest in developing high-level graph ...
Abstract—Graphs are common data structures for many applications, and efficient graph processing is ...
The stagnant performance of single core processors, increasing size of data sets, and variety of str...
We factor Beamer’s push-pull, also known as direction-optimized breadth-first-search (DOBFS) into 3 ...
In this thesis we investigate the relation between the structure of input graphs and the performance...
We present a single-node, multi-GPU programmable graph processing library that allows programmers to...
Multi-core and GPU-based systems offer unprecedented computational power. They are, however, challen...
Despite the fact that GPU was originally intended to be as a co-processor specializing in graphics r...
Optimizing linear algebra operations has been a research topic for decades. The compact languag...
This dissertation advances the state of the art for scalable high-performance graph analytics and da...
For large-scale graph analytics on the GPU, the irregularity of dataaccess/control flow and the comp...