High Level Synthesis tools have reduced accelerator design time. However, a complex scaling problem that remains is the data transfer bottleneck. Accelerators require huge amounts of data and are often limited by interconnect resources. Local buffers can reduce communication by exploiting data reuse, but the data access order has a substantial impact on the amount of reuse that can be utilized. With loop transformations such as interchange and tiling the data access order can be modified. However, for real applications the design space is huge, finding the best set of transformations is often intractable. Therefore, we present a new methodology that minimizes the data transfer by loop interchange and tiling. In contrast to other methods we ...
This paper addresses the problem of compiling perfectly nested loops for multicomputers (distributed...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
Link to published version: http://ieeexplore.ieee.org/iel2/390/6075/00236705.pdf?tp=&arnumber=236705...
High Level Synthesis tools have reduced accelerator design time. However, a complex scaling problem ...
High Level Synthesis tools have reduced accelerator design time. How-ever, a complex scaling problem...
The adoption of High-Level Synthesis (HLS) tools has significantly reduced accelerator design time. ...
High-level synthesis (HLS) is well capable of generating control and computation circuits for FPGA a...
International audienceLoop tiling is a loop transformation widely used to improve spatial and tempor...
We deal with compiler support for parallelizing perfectly nested loops for coarse-grain distributed ...
Dynamic hardware generation reduces the number of FPGA resources needed and speeds up an application...
Some data- and compute-intensive applications can be ac-celerated by offloading portions of codes to...
Iteration space tiling is a common strategy used by parallelizing compilers to reduce communication ...
This paper addresses the problem of compiling nested loops for distributed memory machines. The rela...
PPoPP'12 extended versionInternational audienceSome data- and compute-intensive applications can be ...
International audienceIn the framework of fully permutable loops, tiling is a compiler technique (al...
This paper addresses the problem of compiling perfectly nested loops for multicomputers (distributed...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
Link to published version: http://ieeexplore.ieee.org/iel2/390/6075/00236705.pdf?tp=&arnumber=236705...
High Level Synthesis tools have reduced accelerator design time. However, a complex scaling problem ...
High Level Synthesis tools have reduced accelerator design time. How-ever, a complex scaling problem...
The adoption of High-Level Synthesis (HLS) tools has significantly reduced accelerator design time. ...
High-level synthesis (HLS) is well capable of generating control and computation circuits for FPGA a...
International audienceLoop tiling is a loop transformation widely used to improve spatial and tempor...
We deal with compiler support for parallelizing perfectly nested loops for coarse-grain distributed ...
Dynamic hardware generation reduces the number of FPGA resources needed and speeds up an application...
Some data- and compute-intensive applications can be ac-celerated by offloading portions of codes to...
Iteration space tiling is a common strategy used by parallelizing compilers to reduce communication ...
This paper addresses the problem of compiling nested loops for distributed memory machines. The rela...
PPoPP'12 extended versionInternational audienceSome data- and compute-intensive applications can be ...
International audienceIn the framework of fully permutable loops, tiling is a compiler technique (al...
This paper addresses the problem of compiling perfectly nested loops for multicomputers (distributed...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
Link to published version: http://ieeexplore.ieee.org/iel2/390/6075/00236705.pdf?tp=&arnumber=236705...