Parallel graph-oriented applications expressed in the Bulk-Synchronous Parallel (BSP) and Token Dataflow compute models generate highly-structured communication workloads from messages propagating along graph edges. We can statially expose this structure to traffic compilers and optimization tools to reshape and reduce traffic for higher performance (or lower area, lower energy, lower cost). Such offline traffic optimization eliminates the need for complex, runtime NoC hardware and enables lightweight, scalable NoCs. We perform load balancing, placement, fanout routing, and fine-grained synchronization to optimize our workloads for large networks up to 2025 parallel elements for BSP model and 25 parallel elements for Token Dataflow. This al...
Sequential graph algorithms are implemented through ordered execution of tasks to achieve high work ...
From social networks to language modeling, the growing scale and importance of graph data has driven...
A graph is a ubiquitous data structure that models entities and their interactions through the colle...
Parallel graph-oriented applications expressed in the Bulk-Synchronous Parallel (BSP) and Token Data...
How do we develop programs that are easy to express, easy to reason about, and able to achieve high ...
Future High Performance Computing (HPC) nodes will have many more processors than the contemporary a...
Efficiently processing large graphs is challenging, since parallel graph algorithms suffer from poor...
The scale-out approach of modern data-parallel frameworks such as Apache Flink or Apache Spark has e...
Distributed, shared-nothing architectures of commodity machines are a popular design choice for the ...
The amount of data generated every day is growing exponentially in the big data era. A significant p...
Abstract—In this paper we examine a popular network com-putational model (BSP: Bulk Synchronous Para...
There has been significant recent interest in parallel graph processing due to the need to quickly a...
The importance of high-performance graph processing to solve big data problems targeting high-impact...
In this thesis, we propose optimization techniques for distributed graph processing. First, we descr...
Abstract—We present techniques to process large scale-free graphs in distributed memory. Our aim is ...
Sequential graph algorithms are implemented through ordered execution of tasks to achieve high work ...
From social networks to language modeling, the growing scale and importance of graph data has driven...
A graph is a ubiquitous data structure that models entities and their interactions through the colle...
Parallel graph-oriented applications expressed in the Bulk-Synchronous Parallel (BSP) and Token Data...
How do we develop programs that are easy to express, easy to reason about, and able to achieve high ...
Future High Performance Computing (HPC) nodes will have many more processors than the contemporary a...
Efficiently processing large graphs is challenging, since parallel graph algorithms suffer from poor...
The scale-out approach of modern data-parallel frameworks such as Apache Flink or Apache Spark has e...
Distributed, shared-nothing architectures of commodity machines are a popular design choice for the ...
The amount of data generated every day is growing exponentially in the big data era. A significant p...
Abstract—In this paper we examine a popular network com-putational model (BSP: Bulk Synchronous Para...
There has been significant recent interest in parallel graph processing due to the need to quickly a...
The importance of high-performance graph processing to solve big data problems targeting high-impact...
In this thesis, we propose optimization techniques for distributed graph processing. First, we descr...
Abstract—We present techniques to process large scale-free graphs in distributed memory. Our aim is ...
Sequential graph algorithms are implemented through ordered execution of tasks to achieve high work ...
From social networks to language modeling, the growing scale and importance of graph data has driven...
A graph is a ubiquitous data structure that models entities and their interactions through the colle...