Memory-based computing stores pre-computed function results in memory to be read at runtime. FPGAs group together multiple block memories (BRAMs) to form this memory, all accessed as a single monolithic device. We introduce a novel ring-based architecture to leverage parallel accesses to these constituent BRAMs, benefiting low latency applications that rely on: highly-complex functions; numerical precision via iterative computation; or many parallel data-paths accessing a shared memory resource. The implemented function’s performance is independent of its complexity, enabling significant latency reduc- tions for compute-bound operations. We assess common functions (sqrt, power, trigonometric, hyperbolic functions) on the Xilinx Alveo U280 F...
With the arrival of large Field Programmable Gate Arrays (FPGAs) it is possible to build an entire c...
FPGAs rely on massive datapath parallelism to accelerate applications even with a low clock frequenc...
FPGA-based data processing is becoming increasingly relevant in data centers, as the transformation ...
The FPGAs of today are being used to implement large, system-sized circuits. Systems often require ...
Where do all the cycles go when microprocessor applications are implemented spatially as circuits on...
We propose an FPGA chip architecture based on a conventional FPGA logic array core, in which I/O pin...
A novel efficient bus architecture is presented together with an application. The bus architecture b...
Since they were first introduced three decades ago, Field-Programmable Gate Arrays (FPGAs) have evol...
ABSTRACT Throughput processing involves using many different contexts or threads to solve multiple p...
The performance gap between CPUs, and memory memory has diverged significantly since the 1980's maki...
This article presents a latency-optimized implementation of the SIMECK lightweight block cipher on a...
Many algorithms and applications in scientific computing exhibit irregular access patterns as consec...
On-chip multiport memory cores are crucial primitives for many modern high-performance reconfigurabl...
BIKE is a Key Encapsulation Mechanism selected as an alternate candidate in NIST’s PQC standardizati...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
With the arrival of large Field Programmable Gate Arrays (FPGAs) it is possible to build an entire c...
FPGAs rely on massive datapath parallelism to accelerate applications even with a low clock frequenc...
FPGA-based data processing is becoming increasingly relevant in data centers, as the transformation ...
The FPGAs of today are being used to implement large, system-sized circuits. Systems often require ...
Where do all the cycles go when microprocessor applications are implemented spatially as circuits on...
We propose an FPGA chip architecture based on a conventional FPGA logic array core, in which I/O pin...
A novel efficient bus architecture is presented together with an application. The bus architecture b...
Since they were first introduced three decades ago, Field-Programmable Gate Arrays (FPGAs) have evol...
ABSTRACT Throughput processing involves using many different contexts or threads to solve multiple p...
The performance gap between CPUs, and memory memory has diverged significantly since the 1980's maki...
This article presents a latency-optimized implementation of the SIMECK lightweight block cipher on a...
Many algorithms and applications in scientific computing exhibit irregular access patterns as consec...
On-chip multiport memory cores are crucial primitives for many modern high-performance reconfigurabl...
BIKE is a Key Encapsulation Mechanism selected as an alternate candidate in NIST’s PQC standardizati...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
With the arrival of large Field Programmable Gate Arrays (FPGAs) it is possible to build an entire c...
FPGAs rely on massive datapath parallelism to accelerate applications even with a low clock frequenc...
FPGA-based data processing is becoming increasingly relevant in data centers, as the transformation ...