FPGA designs have an immense design space, and there can be an order of magnitude performance difference between a naive and an optimized implementation. To optimize memory, the programmer faces the burden of a large number of iterative manual optimizations to achieve high performance. Motivated by this challenge, the goal of this work is to make memory optimizations easier to achieve. We present two frameworks 1) a buffer-to-BRAM mapping tool called MPack, and 2) a library of Stream Memory Components (SMCs). MPack optimizes the mapping of application buffers to FPGAâ s on-chip physical memories (BRAMs). One of the key challenges is the mismatch between the dimensions (bit width and depth) of application buffers and FPGAâ s on-chip physic...
Parallel computing platforms provide good performance for streaming applications within a limited po...
The Legup High-Level Synthesis (HLS) tool permits the synthesis of multi-threaded software into para...
With the large resource densities available on modern FPGAs it is often the available memory bandwid...
FPGA designs have an immense design space, and there can be an order of magnitude performance differ...
It is very challenging to design an on-chip memory architecture for high-performance kernels with la...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Abstract—Developing FPGA implementations with an input specification in a high-level programming lan...
Stream languages explicitly describe fork-join parallelism and pipelines, offering a powerful progra...
As we witness the breakdown of Dennard scaling, we can no longer get faster computers by shrinking t...
Behavioral synthesis tools have made significant progress in compiling high-level programs into regi...
The trade-off between throughput and memory constraints is a common design problem in embedded syste...
The trade-off between throughput and memory constraints is a common design problem in embedded syste...
As processor speeds continue to increase, the memory bottleneck remains a primary impediment to atta...
This paper describes a compiler for stream programs that efficiently schedules computational kernels...
Moore's Law has helped Field Programmable Gate Arrays (FPGAs) scale continuously in speed, capacity ...
Parallel computing platforms provide good performance for streaming applications within a limited po...
The Legup High-Level Synthesis (HLS) tool permits the synthesis of multi-threaded software into para...
With the large resource densities available on modern FPGAs it is often the available memory bandwid...
FPGA designs have an immense design space, and there can be an order of magnitude performance differ...
It is very challenging to design an on-chip memory architecture for high-performance kernels with la...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Abstract—Developing FPGA implementations with an input specification in a high-level programming lan...
Stream languages explicitly describe fork-join parallelism and pipelines, offering a powerful progra...
As we witness the breakdown of Dennard scaling, we can no longer get faster computers by shrinking t...
Behavioral synthesis tools have made significant progress in compiling high-level programs into regi...
The trade-off between throughput and memory constraints is a common design problem in embedded syste...
The trade-off between throughput and memory constraints is a common design problem in embedded syste...
As processor speeds continue to increase, the memory bottleneck remains a primary impediment to atta...
This paper describes a compiler for stream programs that efficiently schedules computational kernels...
Moore's Law has helped Field Programmable Gate Arrays (FPGAs) scale continuously in speed, capacity ...
Parallel computing platforms provide good performance for streaming applications within a limited po...
The Legup High-Level Synthesis (HLS) tool permits the synthesis of multi-threaded software into para...
With the large resource densities available on modern FPGAs it is often the available memory bandwid...