Abstract—A nonlinear optimization framework is proposed in this paper to automate exploration of the design space consisting of data-reuse (buffering) decisions and loop-level parallelization, in the context of field-programmable-gate-array-targeted hard-ware compilation. Buffering frequently accessed data in on-chip memories can reduce off-chip memory accesses and open avenues for parallelization. However, the exploitation of both data reuse and parallelization is limited by the memory resources available on-chip. As a result, considering these two problems separately, e.g., first exploring data reuse and then exploring data-level paral-lelization, based on the data-reuse options determined in the first step, may not yield the performance-...
For decades, the computational performance of processors has grown at a faster rate than the availab...
Thesis (Ph.D.)--University of Washington, 2022Modern field-programmable gate arrays (FPGAs) have rec...
FPGA-based accelerators demonstrated high energy efficiency compared to GPUs and CPUs. However, sing...
This thesis presents a methodology to automatically determine a data memory organisation at compilet...
This paper describes our approaches to raise the level of abstraction at which hardwa-re suitable fo...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Behavioral synthesis tools have made significant progress in compiling high-level programs into regi...
145 pagesWith the pursuit of improving compute performance under strict power constraints, there is ...
Field Programmable Gate Array (FPGA) provides the ability to use, and re-use, hardware with minimal ...
With the large resource densities available on modern FPGAs it is often the available memory bandwi...
UnrestrictedConfigurable architectures offer the unique opportunity of realizing hardware designs ta...
Field-programmable gate arrays represent an army of logical units which can be organized in a highly...
With reconfigurable fabrics delivering increasing performance over the years, Field-Programmable Gat...
This paper presents a new methodology based on evolutionary multi-objective optimization (EMO) to sy...
It is very challenging to design an on-chip memory architecture for high-performance kernels with la...
For decades, the computational performance of processors has grown at a faster rate than the availab...
Thesis (Ph.D.)--University of Washington, 2022Modern field-programmable gate arrays (FPGAs) have rec...
FPGA-based accelerators demonstrated high energy efficiency compared to GPUs and CPUs. However, sing...
This thesis presents a methodology to automatically determine a data memory organisation at compilet...
This paper describes our approaches to raise the level of abstraction at which hardwa-re suitable fo...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Behavioral synthesis tools have made significant progress in compiling high-level programs into regi...
145 pagesWith the pursuit of improving compute performance under strict power constraints, there is ...
Field Programmable Gate Array (FPGA) provides the ability to use, and re-use, hardware with minimal ...
With the large resource densities available on modern FPGAs it is often the available memory bandwi...
UnrestrictedConfigurable architectures offer the unique opportunity of realizing hardware designs ta...
Field-programmable gate arrays represent an army of logical units which can be organized in a highly...
With reconfigurable fabrics delivering increasing performance over the years, Field-Programmable Gat...
This paper presents a new methodology based on evolutionary multi-objective optimization (EMO) to sy...
It is very challenging to design an on-chip memory architecture for high-performance kernels with la...
For decades, the computational performance of processors has grown at a faster rate than the availab...
Thesis (Ph.D.)--University of Washington, 2022Modern field-programmable gate arrays (FPGAs) have rec...
FPGA-based accelerators demonstrated high energy efficiency compared to GPUs and CPUs. However, sing...