This dissertation describes a methodology for the generation of a custom memory interface and associated direct memory access (DMA) controller for FPGA-based kernels that have a regular access pattern. The interface provides explicit support for the following features: (1) memory latency hiding, (2) static access scheduling, and (3) data reuse. The target platform is a multi-FPGA platform, the Convey HC-1, which has an advanced memory system that presents the user logic with three critical design challenges: the memory system itself does not perform caching or prefetching, memory operations are arbitrarily reordered, and the memory performance depends on the access order provided by the user logic. The objective of the interface is to recon...
The performance gap between CPUs, and memory memory has diverged significantly since the 1980's maki...
After more than 30 years, reconfigurable computing has grown from a concept to a mature field of scien...
With the large resource densities available on modern FPGAs it is often the available memory bandwi...
This dissertation describes a methodology for the generation of a custom memory interface and associ...
Field Programmable Gate Arrays (FPGAs) have become highly attractive as accelerators due to their lo...
As designers and researchers strive to achieve higher performance, field-programmable gate arrays (F...
There is currently a strong focus across the technological landscape to create machines capable of p...
Many algorithms and applications in scientific computing exhibit irregular access patterns as consec...
High-Level Synthesis (HLS) tools are a set of algorithms that allow programmers to obtain implementa...
Several system-on-chip (SoC) platforms have recently emerged that use reconfigurable logic (FPGAs) a...
FPGA-SoCs like Xilinx's Zynq-7000 and Altera's Generation 10 SoCs provide an integrated platform for...
Designs implemented on field-programmable gate arrays (FPGAs) via high-level synthesis (HLS) suffer...
GPUs offer high-performance floating-point computation at commodity prices, but their usage is hinde...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The main motivation for dynamic memory management is to increase the memory efficiency of a system b...
The performance gap between CPUs, and memory memory has diverged significantly since the 1980's maki...
After more than 30 years, reconfigurable computing has grown from a concept to a mature field of scien...
With the large resource densities available on modern FPGAs it is often the available memory bandwi...
This dissertation describes a methodology for the generation of a custom memory interface and associ...
Field Programmable Gate Arrays (FPGAs) have become highly attractive as accelerators due to their lo...
As designers and researchers strive to achieve higher performance, field-programmable gate arrays (F...
There is currently a strong focus across the technological landscape to create machines capable of p...
Many algorithms and applications in scientific computing exhibit irregular access patterns as consec...
High-Level Synthesis (HLS) tools are a set of algorithms that allow programmers to obtain implementa...
Several system-on-chip (SoC) platforms have recently emerged that use reconfigurable logic (FPGAs) a...
FPGA-SoCs like Xilinx's Zynq-7000 and Altera's Generation 10 SoCs provide an integrated platform for...
Designs implemented on field-programmable gate arrays (FPGAs) via high-level synthesis (HLS) suffer...
GPUs offer high-performance floating-point computation at commodity prices, but their usage is hinde...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The main motivation for dynamic memory management is to increase the memory efficiency of a system b...
The performance gap between CPUs, and memory memory has diverged significantly since the 1980's maki...
After more than 30 years, reconfigurable computing has grown from a concept to a mature field of scien...
With the large resource densities available on modern FPGAs it is often the available memory bandwi...