PPoPP'12 extended versionInternational audienceSome data- and compute-intensive applications can be accelerated by o oading portions of codes to platforms such as GPGPUs or FPGAs. However, to get high performance for these kernels, it is mandatory to restructure the application, to generate adequate communication mechanisms for the transfer of remote data, and to make good usage of the memory bandwidth. In the context of the high-level synthesis (HLS), from a C program, of hardware accelerators on FPGA, we show how to automatically generate optimized remote accesses for an accelerator communicating to an exter- nal DDR memory. Loop tiling is used to enable block com- munications, suitable for DDR memories. Pipelined communication processes ...
Abstract—Current tools for High-Level Synthesis (HLS) excel at exploiting Instruction-Level Parallel...
High-level synthesis (HLS) tools simplify the FPGA design processes by allowing users to express the...
High-level synthesis (HLS) tools provide automatic generation of hardware at the register transfer l...
PPoPP'12 extended versionInternational audienceSome data- and compute-intensive applications can be ...
Some data- and compute-intensive applications can be ac-celerated by offloading portions of codes to...
As the scaling down of transistor size no longer provides boost to processor clock frequency, there ...
Abstract—Thanks to efficient scheduling, resource sharing, and finite-state machines generation, hig...
International audienceThanks to efficient scheduling, resource sharing, and finite-state machines ge...
CPU-FPGA heterogeneous architectures are attracting ever-increasing attention in an attempt to advan...
Field programmable gate arrays or FPGAs are the Swiss army knife of the compute accelerators. They a...
This dissertation focuses on efficient generation of custom processors from high-level language desc...
The embedded DSP blocks in modern Field Programmable Gate Arrays (FPGAs) are highly capable and supp...
Specialized accelerators can exploit spatial parallelism on both operations and data thanks to a ded...
This paper presents a technique for automatic synthesis of high-performance FPGA-based computing mac...
The adoption of High-Level Synthesis (HLS) tools has significantly reduced accelerator design time. ...
Abstract—Current tools for High-Level Synthesis (HLS) excel at exploiting Instruction-Level Parallel...
High-level synthesis (HLS) tools simplify the FPGA design processes by allowing users to express the...
High-level synthesis (HLS) tools provide automatic generation of hardware at the register transfer l...
PPoPP'12 extended versionInternational audienceSome data- and compute-intensive applications can be ...
Some data- and compute-intensive applications can be ac-celerated by offloading portions of codes to...
As the scaling down of transistor size no longer provides boost to processor clock frequency, there ...
Abstract—Thanks to efficient scheduling, resource sharing, and finite-state machines generation, hig...
International audienceThanks to efficient scheduling, resource sharing, and finite-state machines ge...
CPU-FPGA heterogeneous architectures are attracting ever-increasing attention in an attempt to advan...
Field programmable gate arrays or FPGAs are the Swiss army knife of the compute accelerators. They a...
This dissertation focuses on efficient generation of custom processors from high-level language desc...
The embedded DSP blocks in modern Field Programmable Gate Arrays (FPGAs) are highly capable and supp...
Specialized accelerators can exploit spatial parallelism on both operations and data thanks to a ded...
This paper presents a technique for automatic synthesis of high-performance FPGA-based computing mac...
The adoption of High-Level Synthesis (HLS) tools has significantly reduced accelerator design time. ...
Abstract—Current tools for High-Level Synthesis (HLS) excel at exploiting Instruction-Level Parallel...
High-level synthesis (HLS) tools simplify the FPGA design processes by allowing users to express the...
High-level synthesis (HLS) tools provide automatic generation of hardware at the register transfer l...