The performance and the versatility of today's PCs exceeds many times the power of the fastest number crunchers in the 90s. Yet the computational hunger of many scientific applications has led to the development of GPU- and FPGA-accelerator cards. In this paper the programming environment and the performance analysis of a super desktop with a combined GPU/FPGA architecture is presented. A unified roofline model is used to compare the performance of the GPU and the FPGA taking into account the computational intensity of the algorithm and the resource consumption. The model is validated by two image processing kernels which are compiled using OpenCL for the GPU and a C-to-VHDL compiler for the FPGA. It is shown that an FPGA compiler outperfor...
Today's computer systems often contains several different processing units aside from the CPU. Among...
Heterogeneous computing offers a promising solution for high performance and energy efficient comput...
There is a growing trend to use coprocessors to offload and accelerate domain-specific applications ...
The performance and the versatility of today's PCs exceeds many times the power of the fastest numbe...
This contribution presents the performance modeling of a super desktop with GPU and FPGA accelerator...
Low-power, high-performance computing nowadays relies on accelerator cards to speed up the calculati...
Nowadays, processors alone cannot deliver what computation hungry image processing applications dema...
Abstract:- The performance of modeling and simulation tools is inherently tied to the platform on wh...
Computation of intensive interactive software applications on R&D desktops require a versatile hardw...
OpenCL has been proposed as a means of accelerating functional computation using FPGA and GPU accele...
Floating-point computing with more than one TFLOP of peak performance is already a reality in recent...
As CPU clock frequencies plateau and the doubling of CPU cores per processor ex-acerbate the memory ...
CPU’s performance is not enough to fit today’s needs, such as cloud computing, biomedical research, ...
The application of accelerators in HPC applications has seen enormous growth in the last decade. In ...
High Performance Computing (HPC) platforms allow scientists to model computationally intensive algor...
Today's computer systems often contains several different processing units aside from the CPU. Among...
Heterogeneous computing offers a promising solution for high performance and energy efficient comput...
There is a growing trend to use coprocessors to offload and accelerate domain-specific applications ...
The performance and the versatility of today's PCs exceeds many times the power of the fastest numbe...
This contribution presents the performance modeling of a super desktop with GPU and FPGA accelerator...
Low-power, high-performance computing nowadays relies on accelerator cards to speed up the calculati...
Nowadays, processors alone cannot deliver what computation hungry image processing applications dema...
Abstract:- The performance of modeling and simulation tools is inherently tied to the platform on wh...
Computation of intensive interactive software applications on R&D desktops require a versatile hardw...
OpenCL has been proposed as a means of accelerating functional computation using FPGA and GPU accele...
Floating-point computing with more than one TFLOP of peak performance is already a reality in recent...
As CPU clock frequencies plateau and the doubling of CPU cores per processor ex-acerbate the memory ...
CPU’s performance is not enough to fit today’s needs, such as cloud computing, biomedical research, ...
The application of accelerators in HPC applications has seen enormous growth in the last decade. In ...
High Performance Computing (HPC) platforms allow scientists to model computationally intensive algor...
Today's computer systems often contains several different processing units aside from the CPU. Among...
Heterogeneous computing offers a promising solution for high performance and energy efficient comput...
There is a growing trend to use coprocessors to offload and accelerate domain-specific applications ...