It is very challenging to design an on-chip memory architecture for high-performance kernels with large amount of computation and data. The on-chip memory architecture must support efficient data access from both the computation part and the external memory part, which often have very different expectations about how data should be accessed and stored. Previous work provides only a limited set of optimizations. In this paper we show how to fundamentally restructure on-chip buffers, by decoupling logical array view from the physical buffer view, and providing general mapping schemes for the two. Our framework considers the entire data flow from the external memory to the computation part in order to minimize resource usage without creating p...
PPoPP'12 extended versionInternational audienceSome data- and compute-intensive applications can be ...
Memory is the biggest limiting factor to the widespread use of FPGAs for high-level image processing...
UnrestrictedConfigurable architectures offer the unique opportunity of realizing hardware designs ta...
FPGA designs have an immense design space, and there can be an order of magnitude performance differ...
Abstract—Developing FPGA implementations with an input specification in a high-level programming lan...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The high potential performance of FPGAs cannot be exploited if a design suffers a memory bottleneck....
The signicant development of high-level synthesis tools has greatly facilitated FPGAs as general com...
Memory is the biggest limiting factor to the widespread use of FPGAs for high-level image processing...
Some data- and compute-intensive applications can be ac-celerated by offloading portions of codes to...
Designs implemented on field-programmable gate arrays (FPGAs) via high-level synthesis (HLS) suffer...
Recent works demonstrate several benefits of synthesizing software binaries onto FPGA hardware, incl...
Behavioral synthesis tools have made significant progress in compiling high-level programs into regi...
With the large resource densities available on modern FPGAs it is often the available memory bandwid...
International audiencemost of advanced driver assistance systems are developed for safety and better...
PPoPP'12 extended versionInternational audienceSome data- and compute-intensive applications can be ...
Memory is the biggest limiting factor to the widespread use of FPGAs for high-level image processing...
UnrestrictedConfigurable architectures offer the unique opportunity of realizing hardware designs ta...
FPGA designs have an immense design space, and there can be an order of magnitude performance differ...
Abstract—Developing FPGA implementations with an input specification in a high-level programming lan...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The high potential performance of FPGAs cannot be exploited if a design suffers a memory bottleneck....
The signicant development of high-level synthesis tools has greatly facilitated FPGAs as general com...
Memory is the biggest limiting factor to the widespread use of FPGAs for high-level image processing...
Some data- and compute-intensive applications can be ac-celerated by offloading portions of codes to...
Designs implemented on field-programmable gate arrays (FPGAs) via high-level synthesis (HLS) suffer...
Recent works demonstrate several benefits of synthesizing software binaries onto FPGA hardware, incl...
Behavioral synthesis tools have made significant progress in compiling high-level programs into regi...
With the large resource densities available on modern FPGAs it is often the available memory bandwid...
International audiencemost of advanced driver assistance systems are developed for safety and better...
PPoPP'12 extended versionInternational audienceSome data- and compute-intensive applications can be ...
Memory is the biggest limiting factor to the widespread use of FPGAs for high-level image processing...
UnrestrictedConfigurable architectures offer the unique opportunity of realizing hardware designs ta...