The signicant development of high-level synthesis tools has greatly facilitated FPGAs as general computing platforms. During the parallelism optimization for the data path, mem-ory becomes a crucial bottleneck that impedes performance enhancement. Simultaneous data access is highly restricted by the data mapping strategy and memory port constraint. Memory partitioning can eciently map data elements in the same logical array onto multiple physical banks so that the accesses to the array are parallelized. Previous meth-ods for memory partitioning mainly focused on cyclic parti-tioning for single-port memory. In this work we propose a generalized memory-partitioning framework to provide high data throughput of on-chip memories. We generalize c...
[[abstract]]In this paper, we present a new integrated synthesis and partitioning method for multipl...
Behavioral synthesis tools have made significant progress in compiling high-level programs into regi...
This paper discusses a pair of synthesis algorithms that optimise a SystemC design to minimise area ...
Memory partitioning is widely adopted to efficiently increase the memory bandwidth by using multiple...
The Legup High-Level Synthesis (HLS) tool permits the synthesis of multi-threaded software into para...
One step in the synthesis for FPGA-based Reconfig-urable Computers (RCs) involves mapping the design...
Modern, high performance reconfigurable architectures integrate on-chip, distributed block RAM modul...
In the last decade, FPGAs appeared as a credible alternative for big data and high-performance compu...
Achieving optimal throughput by extracting parallelism in behavioral synthesis often exaggerates mem...
It is very challenging to design an on-chip memory architecture for high-performance kernels with la...
This paper describes a new procedure for generating very large realistic benchmark circuits which ar...
[[abstract]]In this paper, we present a new integrated synthesis and partitioning method for multipl...
Being \u27memory-centric\u27, the single-chip distributed logic-memory (DLM) architecture can signif...
In this paper, we introduce a new recursive partitioning paradigm PROP which combines partitioning, ...
We demonstrate circuits that generate set and integer partitions on a set S of n objects at a rate o...
[[abstract]]In this paper, we present a new integrated synthesis and partitioning method for multipl...
Behavioral synthesis tools have made significant progress in compiling high-level programs into regi...
This paper discusses a pair of synthesis algorithms that optimise a SystemC design to minimise area ...
Memory partitioning is widely adopted to efficiently increase the memory bandwidth by using multiple...
The Legup High-Level Synthesis (HLS) tool permits the synthesis of multi-threaded software into para...
One step in the synthesis for FPGA-based Reconfig-urable Computers (RCs) involves mapping the design...
Modern, high performance reconfigurable architectures integrate on-chip, distributed block RAM modul...
In the last decade, FPGAs appeared as a credible alternative for big data and high-performance compu...
Achieving optimal throughput by extracting parallelism in behavioral synthesis often exaggerates mem...
It is very challenging to design an on-chip memory architecture for high-performance kernels with la...
This paper describes a new procedure for generating very large realistic benchmark circuits which ar...
[[abstract]]In this paper, we present a new integrated synthesis and partitioning method for multipl...
Being \u27memory-centric\u27, the single-chip distributed logic-memory (DLM) architecture can signif...
In this paper, we introduce a new recursive partitioning paradigm PROP which combines partitioning, ...
We demonstrate circuits that generate set and integer partitions on a set S of n objects at a rate o...
[[abstract]]In this paper, we present a new integrated synthesis and partitioning method for multipl...
Behavioral synthesis tools have made significant progress in compiling high-level programs into regi...
This paper discusses a pair of synthesis algorithms that optimise a SystemC design to minimise area ...