Abstract—Developing FPGA implementations with an input specification in a high-level programming language such as C/C++ or OpenCL allows for a substantially shortened design cycle compared to a design entry at register transfer level. This work targets high-level synthesis (HLS) implementations that process large amounts of data and therefore require access to an off-chip memory. We leverage the customizability of the FPGA on-chip memory to automatically construct a multi-cache architecture in order to enhance the performance of the interface between parallel functional units of the HLS core and an external memory. Our focus is on automatic cache sizing. Firstly, our technique determines and uses up unused left-over block RAM resources for ...
It is very challenging to design an on-chip memory architecture for high-performance kernels with la...
The performance of a computing system heavily depends on the memory hierarchy. Fast but expensive ca...
The demand for scalable, high-performance computing has increased as the size of datasets has grown ...
Memory-intensive implementations often require access to an external, off-chip memory which can subs...
Designs implemented on field-programmable gate arrays (FPGAs) via high-level synthesis (HLS) suffer...
Using FPGA-based acceleration of high-performance computing (HPC) applications to reduce energy and ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
As memory accesses increasingly limit the overall performance of reconfigurable accelerators, it is ...
Caches in FPGAs can improve the performance of soft processors and other applications beset by slow ...
Memory-intensive implementations often require access to an external, off-chip memory which can subs...
The Legup High-Level Synthesis (HLS) tool permits the synthesis of multi-threaded software into para...
ABSTRACT Throughput processing involves using many different contexts or threads to solve multiple p...
In the last decade, FPGAs appeared as a credible alternative for big data and high-performance compu...
FPGA designs have an immense design space, and there can be an order of magnitude performance differ...
Abstract—We describe new multi-ported cache designs suit-able for use in FPGA-based processor/parall...
It is very challenging to design an on-chip memory architecture for high-performance kernels with la...
The performance of a computing system heavily depends on the memory hierarchy. Fast but expensive ca...
The demand for scalable, high-performance computing has increased as the size of datasets has grown ...
Memory-intensive implementations often require access to an external, off-chip memory which can subs...
Designs implemented on field-programmable gate arrays (FPGAs) via high-level synthesis (HLS) suffer...
Using FPGA-based acceleration of high-performance computing (HPC) applications to reduce energy and ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
As memory accesses increasingly limit the overall performance of reconfigurable accelerators, it is ...
Caches in FPGAs can improve the performance of soft processors and other applications beset by slow ...
Memory-intensive implementations often require access to an external, off-chip memory which can subs...
The Legup High-Level Synthesis (HLS) tool permits the synthesis of multi-threaded software into para...
ABSTRACT Throughput processing involves using many different contexts or threads to solve multiple p...
In the last decade, FPGAs appeared as a credible alternative for big data and high-performance compu...
FPGA designs have an immense design space, and there can be an order of magnitude performance differ...
Abstract—We describe new multi-ported cache designs suit-able for use in FPGA-based processor/parall...
It is very challenging to design an on-chip memory architecture for high-performance kernels with la...
The performance of a computing system heavily depends on the memory hierarchy. Fast but expensive ca...
The demand for scalable, high-performance computing has increased as the size of datasets has grown ...