In this paper, we report on a preliminary investigation of the potential performance gain of programs implemented in field-programmable gate arrays (FPGAs) using a high-level language Chisel compared to ordinary high-level software implementations executed on general-purpose computers and small and cheap computers. FPGAs inherently support parallel evaluations, while sequential computers do not. For this preliminary investigation, we have chosen a highly parallelizable program as a case study to show an upper bound of performance gain. The purpose is to demonstrate whether or not programming FPGAs has the potential for performance optimizations of ordinary programs. We have developed and evaluated Conway's Game of Life for an FPGA, a small ...
Floating-point computing with more than one TFLOP of peak performance is already a reality in recent...
The field of FPGA design is ever-growing due to costs being lower than that of ASICs, as well as the...
Automated code generation and performance tuning tech-niques for concurrent architectures such as GP...
Field-programmable gate arrays represent an army of logical units which can be organized in a highly...
In recent years, the world of high performance computing has been developing rapidly. The goal of t...
After more than 30 years, reconfigurable computing has grown from a concept to a mature field of scien...
As CPU clock frequencies plateau and the doubling of CPU cores per processor ex-acerbate the memory ...
General purpose computing architectures are being called on to work on amore diverse application mix...
Abstract: In order to optimize applications in the Cellular Automata model we have searched for a pe...
The speedup over a microprocessor that can be achieved by implementing some programs on an FPGA has ...
Many computationally intensive scientific applications involve repetitive floating point operations ...
It has been shown that FPGAs could outperform high-end microprocessors on floating-point compu-tatio...
High-Level Languages (HLLs) for FPGAs (Field-Programmable Gate Arrays) facilitate the use of reconfi...
The symposium ParaFPGA focuses on parallel techniques using FPGAs as accelerator in high performance...
Field Programmable Gate Array (FPGA) provides the ability to use, and re-use, hardware with minimal ...
Floating-point computing with more than one TFLOP of peak performance is already a reality in recent...
The field of FPGA design is ever-growing due to costs being lower than that of ASICs, as well as the...
Automated code generation and performance tuning tech-niques for concurrent architectures such as GP...
Field-programmable gate arrays represent an army of logical units which can be organized in a highly...
In recent years, the world of high performance computing has been developing rapidly. The goal of t...
After more than 30 years, reconfigurable computing has grown from a concept to a mature field of scien...
As CPU clock frequencies plateau and the doubling of CPU cores per processor ex-acerbate the memory ...
General purpose computing architectures are being called on to work on amore diverse application mix...
Abstract: In order to optimize applications in the Cellular Automata model we have searched for a pe...
The speedup over a microprocessor that can be achieved by implementing some programs on an FPGA has ...
Many computationally intensive scientific applications involve repetitive floating point operations ...
It has been shown that FPGAs could outperform high-end microprocessors on floating-point compu-tatio...
High-Level Languages (HLLs) for FPGAs (Field-Programmable Gate Arrays) facilitate the use of reconfi...
The symposium ParaFPGA focuses on parallel techniques using FPGAs as accelerator in high performance...
Field Programmable Gate Array (FPGA) provides the ability to use, and re-use, hardware with minimal ...
Floating-point computing with more than one TFLOP of peak performance is already a reality in recent...
The field of FPGA design is ever-growing due to costs being lower than that of ASICs, as well as the...
Automated code generation and performance tuning tech-niques for concurrent architectures such as GP...