Specialized accelerators are increasingly attractive solutions to continue expected generational performance scaling with slowing technology scaling. Existing programmable accelerators like GPUs are limited to regular algorithms; however, supporting irregularity becomes necessary to accelerate more advanced algorithms or those from challenging domains.Irregularity occurs when aspects of the execution depend on the data -- these aspects can be control, memory, parallelism, and reuse. An example of data-dependent execution is joining two sparse lists where the branch outcome depends on data; this inherently couples computation with memory and precludes efficient vectorization -- defeating the traditional mechanisms of programmable accelerator...
Accelerators, such as GPUs and Intel Xeon Phis, have become the workhorses of high-performance compu...
The world needs special-purpose accelerators to meet future constraints on computation and power con...
The last two decade has witnessed two opposing hardware trends where the DRAM capacity and the acces...
Specialized accelerators are increasingly attractive solutions to continue expected generational per...
The consistent growth of DRAM memory bandwidth and capacity has enabled the computation of increasin...
Programmable accelerators such as GPUs, FPGAs, and DSPs enable modern systems to provide higher perf...
Irregular applications have frequent data-dependent memory accesses and control flow. They arise in ...
<p>Heterogeneous processors with accelerators provide an opportunity to improve performance within a...
Abstract—Although GPGPUs are traditionally used to accel-erate workloads with regular control and me...
Parallel computing hardware is ubiquitous, ranging from cell-phones with multiple cores to super-com...
Real-time, low-energy constraints as well as large amounts of data continue to challenge high perfor...
GPGPU architectures have become the dominant platform for massively parallel workloads, delivering h...
Future high-performance computing systems will be hybrid; they will include processors optimized for...
As we witness the breakdown of Dennard scaling, we can no longer get faster computers by shrinking t...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
Accelerators, such as GPUs and Intel Xeon Phis, have become the workhorses of high-performance compu...
The world needs special-purpose accelerators to meet future constraints on computation and power con...
The last two decade has witnessed two opposing hardware trends where the DRAM capacity and the acces...
Specialized accelerators are increasingly attractive solutions to continue expected generational per...
The consistent growth of DRAM memory bandwidth and capacity has enabled the computation of increasin...
Programmable accelerators such as GPUs, FPGAs, and DSPs enable modern systems to provide higher perf...
Irregular applications have frequent data-dependent memory accesses and control flow. They arise in ...
<p>Heterogeneous processors with accelerators provide an opportunity to improve performance within a...
Abstract—Although GPGPUs are traditionally used to accel-erate workloads with regular control and me...
Parallel computing hardware is ubiquitous, ranging from cell-phones with multiple cores to super-com...
Real-time, low-energy constraints as well as large amounts of data continue to challenge high perfor...
GPGPU architectures have become the dominant platform for massively parallel workloads, delivering h...
Future high-performance computing systems will be hybrid; they will include processors optimized for...
As we witness the breakdown of Dennard scaling, we can no longer get faster computers by shrinking t...
Many applications with regular parallelism have been shown to benefit from using Graphics Processing...
Accelerators, such as GPUs and Intel Xeon Phis, have become the workhorses of high-performance compu...
The world needs special-purpose accelerators to meet future constraints on computation and power con...
The last two decade has witnessed two opposing hardware trends where the DRAM capacity and the acces...