In recent years, GPGPUs have experienced tremendous growth as general-purpose and high-throughput computing devices. Applications from various domains achieve significant speedups using GPGPUs. However, irregular applications do not perform well due to the mismatches between irregular problem structures and SIMD-like GPU architectures. The lack of in-depth characterization and quantifying the ways in which irregular applications differ from regular ones on GPGPUs has prevented users from effectively making use of the hardware resource. To characterize the performance aspects and analyze the bottlenecks, a suite of representative irregular applications are examined on a cycle-accurate GPU simulator as well as a real GPU. The experimental ...
Graph processing is an established and prominent domain that is the foundation of new emerging appli...
Data analysis is a rising field of interest for computer science research due to the growing amount ...
We present an efficient model to analyze and improve the performance of general-purpose computation ...
In recent years, GPGPUs have experienced tremendous growth as general-purpose and high-throughput co...
The computational speed on microprocessors is increasing faster than the communication speed, especi...
Future high-performance computing systems will be hybrid; they will include processors optimized for...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
Abstract—Although GPGPUs are traditionally used to accel-erate workloads with regular control and me...
Graphs are de facto data structures for many applications, and efficient graph processing is a must ...
Implementing a graph traversal (GT) algorithm for GPUs is a very challenging task. It is a core prim...
GPGPU architectures have become the dominant platform for massively parallel workloads, delivering h...
Graphic processors are becoming faster and faster. Computational power within graphic processing uni...
Graphical processing units (GPUs) have recently attracted attention for scientific applications such...
Graph analytics is fundamental in unlocking key insights by mining large volumes of highly connected...
Efficiently implementing a load balancing technique in graph traversal applications for GPUs is a cr...
Graph processing is an established and prominent domain that is the foundation of new emerging appli...
Data analysis is a rising field of interest for computer science research due to the growing amount ...
We present an efficient model to analyze and improve the performance of general-purpose computation ...
In recent years, GPGPUs have experienced tremendous growth as general-purpose and high-throughput co...
The computational speed on microprocessors is increasing faster than the communication speed, especi...
Future high-performance computing systems will be hybrid; they will include processors optimized for...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
Abstract—Although GPGPUs are traditionally used to accel-erate workloads with regular control and me...
Graphs are de facto data structures for many applications, and efficient graph processing is a must ...
Implementing a graph traversal (GT) algorithm for GPUs is a very challenging task. It is a core prim...
GPGPU architectures have become the dominant platform for massively parallel workloads, delivering h...
Graphic processors are becoming faster and faster. Computational power within graphic processing uni...
Graphical processing units (GPUs) have recently attracted attention for scientific applications such...
Graph analytics is fundamental in unlocking key insights by mining large volumes of highly connected...
Efficiently implementing a load balancing technique in graph traversal applications for GPUs is a cr...
Graph processing is an established and prominent domain that is the foundation of new emerging appli...
Data analysis is a rising field of interest for computer science research due to the growing amount ...
We present an efficient model to analyze and improve the performance of general-purpose computation ...