The ever-increasing complexity of high-performance computing applications limits performance due to memory constraints in FPGAs. To address this issue, we propose the Advanced Pattern based Memory Controller (APMC), which supports both regular and irregular memory patterns. The proposed memory controller systematically reduces the latency faced by processors/accelerators due to irregular memory access patterns and low memory bandwidth by using a smart mechanism that collects and stores the different patterns and reuses them when it is needed. In order to prove the effectiveness of the proposed controller, we implemented and tested it on a Xilinx ML505 FPGA board. In order to prove that our controller is efficient in a variety of scenarios, ...
As designers and researchers strive to achieve higher performance, field-programmable gate arrays (F...
Modern embedded compute platforms increasingly contain both microprocessors and field-programmable g...
Accessing the memory efficiently to keep up with the data processing rate is a well known problem in...
The ever-increasing complexity of high-performance computing applications limits performance due to ...
Heterogeneous architectures are increasingly popular due to their flexibility and high performance p...
Many algorithms and applications in scientific computing exhibit irregular access patterns as consec...
In this work, we propose a Programmable Vector Memory Controller (PVMC), which boosts noncontiguous ...
HPC industry demands more computing units on FPGAs, to enhance the performance by using task/data pa...
To manage power and memory wall affects, the HPC industry supports FPGA reconfigurable accelerators ...
The motivation of this research was to evaluate the main memory performance of a hybrid super comput...
Traditionally, system designers have attempted to improve system performance by scheduling the proce...
Over the past few years there has been increased interest in building custom computing machines (CCM...
The Gustavson’s algorithm (i.e., the row-wise product algorithm) shows its potential as the backbone...
Increasingly High-Performance Computing (HPC) applications run on heterogeneous multi-core platforms...
With computing systems becoming ubiquitous, numerous data sets of extremely large size are becoming ...
As designers and researchers strive to achieve higher performance, field-programmable gate arrays (F...
Modern embedded compute platforms increasingly contain both microprocessors and field-programmable g...
Accessing the memory efficiently to keep up with the data processing rate is a well known problem in...
The ever-increasing complexity of high-performance computing applications limits performance due to ...
Heterogeneous architectures are increasingly popular due to their flexibility and high performance p...
Many algorithms and applications in scientific computing exhibit irregular access patterns as consec...
In this work, we propose a Programmable Vector Memory Controller (PVMC), which boosts noncontiguous ...
HPC industry demands more computing units on FPGAs, to enhance the performance by using task/data pa...
To manage power and memory wall affects, the HPC industry supports FPGA reconfigurable accelerators ...
The motivation of this research was to evaluate the main memory performance of a hybrid super comput...
Traditionally, system designers have attempted to improve system performance by scheduling the proce...
Over the past few years there has been increased interest in building custom computing machines (CCM...
The Gustavson’s algorithm (i.e., the row-wise product algorithm) shows its potential as the backbone...
Increasingly High-Performance Computing (HPC) applications run on heterogeneous multi-core platforms...
With computing systems becoming ubiquitous, numerous data sets of extremely large size are becoming ...
As designers and researchers strive to achieve higher performance, field-programmable gate arrays (F...
Modern embedded compute platforms increasingly contain both microprocessors and field-programmable g...
Accessing the memory efficiently to keep up with the data processing rate is a well known problem in...