With the large resource densities available on modern FPGAs it is often the available memory bandwidth that limits the parallelism (and therefore performance) that can be achieved. For this reason the focus of this thesis is the development of an integrated scheduling and memory optimisation methodology to allow high levels of parallelism to be exploited in FPGA based designs. A manual translation from C to hardware is first investigated as a case study, exposing a number of potential optimisation techniques that have not been exploited in existing work. An existing outer loop pipelining approach, originally developed for VLIW processors, is extended and adapted for application to FPGAs. The outer loop pipelining methodology is first develo...
Timothy J. Callahan and John Wawrzynek University of California--Berkeley Widespread acceptance of F...
This paper proposes an algorithm for mappinglogical to physical memory resources on Field-Programmab...
This paper discusses a pair of synthesis algorithms that optimise a SystemC design to minimise area ...
With the large resource densities available on modern FPGAs it is often the available memory bandwi...
This paper describes our approaches to raise the level of abstraction at which hardwa-re suitable fo...
Behavioral synthesis tools have made significant progress in compiling high-level programs into regi...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Moore's Law has helped Field Programmable Gate Arrays (FPGAs) scale continuously in speed, capacity ...
FPGA designs have an immense design space, and there can be an order of magnitude performance differ...
This paper discusses the balance between loop-level parallelism and clock rate for enhancing the per...
This paper discusses a pair of synthesis algorithms that optimise a SystemC design to minimise area ...
Modern embedded systems for DSP applications are increasingly being implemented on heterogeneous pro...
The Legup High-Level Synthesis (HLS) tool permits the synthesis of multi-threaded software into para...
It is very challenging to design an on-chip memory architecture for high-performance kernels with la...
The optimal control of memory space to raise speed of parallel processing systems is a scienti...
Timothy J. Callahan and John Wawrzynek University of California--Berkeley Widespread acceptance of F...
This paper proposes an algorithm for mappinglogical to physical memory resources on Field-Programmab...
This paper discusses a pair of synthesis algorithms that optimise a SystemC design to minimise area ...
With the large resource densities available on modern FPGAs it is often the available memory bandwi...
This paper describes our approaches to raise the level of abstraction at which hardwa-re suitable fo...
Behavioral synthesis tools have made significant progress in compiling high-level programs into regi...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Moore's Law has helped Field Programmable Gate Arrays (FPGAs) scale continuously in speed, capacity ...
FPGA designs have an immense design space, and there can be an order of magnitude performance differ...
This paper discusses the balance between loop-level parallelism and clock rate for enhancing the per...
This paper discusses a pair of synthesis algorithms that optimise a SystemC design to minimise area ...
Modern embedded systems for DSP applications are increasingly being implemented on heterogeneous pro...
The Legup High-Level Synthesis (HLS) tool permits the synthesis of multi-threaded software into para...
It is very challenging to design an on-chip memory architecture for high-performance kernels with la...
The optimal control of memory space to raise speed of parallel processing systems is a scienti...
Timothy J. Callahan and John Wawrzynek University of California--Berkeley Widespread acceptance of F...
This paper proposes an algorithm for mappinglogical to physical memory resources on Field-Programmab...
This paper discusses a pair of synthesis algorithms that optimise a SystemC design to minimise area ...