Abstract—Heterogeneous computing using FPGA accelerators is a promising approach to boost the performance of application programs within given power consumption. This paper focuses on optimizations targeting FPGA-based reconfigurable dataflow computing platform, and shows how they benefit an application. In order to evaluate them, we use the Himeno benchmark, which is a floating point computation kernel known to be bound by memory bandwidth. To understand the performance characteristics of the benchmark, we compare it with the current state-of-the-art implementation on GPUs. From the results, we find that our implementation with specialized dataflow pipelines outperforms the current state-of-the-art GPU implementations by making full use of...
The end of Dennard scaling and the imminent end of Moore's law is causing disruptive changes to the ...
FPGA streaming systems are well suited for high-performance computing (HPC) applications, where the ...
FPGA-based token dataflow processing has been shown to accelerate hard-to-parallelize problems exhib...
This paper proposes a new high-level approach for optimising field programmable gate array (FPGA) de...
As we observe diminishing returns for multi-core CPUs, especially when considering power budgets, FP...
Power efficiency became an important factor in High Performance Computing (HPC). FPGA-based dataflow...
Abstract This paper proposes a new high-level approach for optimising field pro-grammable gate array...
Hardware accelerators are nowadays very common in HPC systems, and GPUs are playing a major role in ...
Heterogeneous computing offers a promising solution for high performance and energy efficient comput...
This book is concerned with the emerging field of High Performance Reconfigurable Computing (HPRC), ...
Highly-tuned FPGA implementations can achieve significant performance and power efficiency gains ove...
Power flow computation is ubiquitous in the operation and planning of power systems.\ud Traditional ...
International audienceDomain-specific acceleration is now a "must" for all the computing spectrum, g...
Copyright © 2015 Julio Dondo Gazzano et al. This is an open access article distributed under the Cre...
The saturation of single-thread performance, along with the advent of the power wall, has resulted i...
The end of Dennard scaling and the imminent end of Moore's law is causing disruptive changes to the ...
FPGA streaming systems are well suited for high-performance computing (HPC) applications, where the ...
FPGA-based token dataflow processing has been shown to accelerate hard-to-parallelize problems exhib...
This paper proposes a new high-level approach for optimising field programmable gate array (FPGA) de...
As we observe diminishing returns for multi-core CPUs, especially when considering power budgets, FP...
Power efficiency became an important factor in High Performance Computing (HPC). FPGA-based dataflow...
Abstract This paper proposes a new high-level approach for optimising field pro-grammable gate array...
Hardware accelerators are nowadays very common in HPC systems, and GPUs are playing a major role in ...
Heterogeneous computing offers a promising solution for high performance and energy efficient comput...
This book is concerned with the emerging field of High Performance Reconfigurable Computing (HPRC), ...
Highly-tuned FPGA implementations can achieve significant performance and power efficiency gains ove...
Power flow computation is ubiquitous in the operation and planning of power systems.\ud Traditional ...
International audienceDomain-specific acceleration is now a "must" for all the computing spectrum, g...
Copyright © 2015 Julio Dondo Gazzano et al. This is an open access article distributed under the Cre...
The saturation of single-thread performance, along with the advent of the power wall, has resulted i...
The end of Dennard scaling and the imminent end of Moore's law is causing disruptive changes to the ...
FPGA streaming systems are well suited for high-performance computing (HPC) applications, where the ...
FPGA-based token dataflow processing has been shown to accelerate hard-to-parallelize problems exhib...