Pipelining algorithms are typically concerned with improving only the steady-state performance, or the kernel time. The pipeline setup time happens only once and therefore can be negligible compared to the kernel time. However, for Coarse-Grained Reconfigurable Architectures (CGRAs) used as a coprocessor to a main processor, pipeline setup can take much longer due to the communication delay between the two processors, and can become significant if it is repeated in an outer loop of a loop nest. In this paper we evaluate the overhead of such non-kernel execution times when mapping nested loops for CGRAs, and propose a novel architecture-compiler cooperative scheme to reduce the overhead, while also minimizing the number of extra configuratio...
Reconfigurable systems have drawn increasing attention from both academic researchers and creators o...
With the increasing demand for flexible yet highly efficient architecture platforms for media applic...
International audienceResearch interest and industry investment in edge computing solutions have inc...
Nested loops represent a significant portion of application runtime in multimedia and DSP applicatio...
Thesis (Ph.D.)--University of Washington, 2017-06This dissertation presents an execution model and c...
For loop accelerators such as coarse-grained reconfigurable architectures (CGRAs) and GP-GPUs, neste...
Abstract Coarse-Grained Reconfigurable Array (CGRA) architectures accelerate the same inner loops th...
Coarse-Grained Reconfigurable Array (CGRA) processors accelerate inner loops of applications by expl...
Coarse-Grained Reconfigurable Array (CGRA) architectures accelerate the same inner loops that benefit ...
Abstract—OpenCL is undoubtedly becoming one of the most popular parallel programming languages as it...
Due to their flexibility and high performance, Coarse Grained Reconfigurable Array (CGRA) are a topi...
Coarse-Grained Reconfigurable Architectures (CGRAs), often used as coprocessors for DSP and multimed...
Coarse-grained reconfigurable arrays (CGRAs) are a very promising platform, providing both up to 10-...
Parallelizing compilers promise to exploit the parallelism available in a given program, particularl...
Control divergence poses many problems in parallelizing loops. While predicated execution is commonl...
Reconfigurable systems have drawn increasing attention from both academic researchers and creators o...
With the increasing demand for flexible yet highly efficient architecture platforms for media applic...
International audienceResearch interest and industry investment in edge computing solutions have inc...
Nested loops represent a significant portion of application runtime in multimedia and DSP applicatio...
Thesis (Ph.D.)--University of Washington, 2017-06This dissertation presents an execution model and c...
For loop accelerators such as coarse-grained reconfigurable architectures (CGRAs) and GP-GPUs, neste...
Abstract Coarse-Grained Reconfigurable Array (CGRA) architectures accelerate the same inner loops th...
Coarse-Grained Reconfigurable Array (CGRA) processors accelerate inner loops of applications by expl...
Coarse-Grained Reconfigurable Array (CGRA) architectures accelerate the same inner loops that benefit ...
Abstract—OpenCL is undoubtedly becoming one of the most popular parallel programming languages as it...
Due to their flexibility and high performance, Coarse Grained Reconfigurable Array (CGRA) are a topi...
Coarse-Grained Reconfigurable Architectures (CGRAs), often used as coprocessors for DSP and multimed...
Coarse-grained reconfigurable arrays (CGRAs) are a very promising platform, providing both up to 10-...
Parallelizing compilers promise to exploit the parallelism available in a given program, particularl...
Control divergence poses many problems in parallelizing loops. While predicated execution is commonl...
Reconfigurable systems have drawn increasing attention from both academic researchers and creators o...
With the increasing demand for flexible yet highly efficient architecture platforms for media applic...
International audienceResearch interest and industry investment in edge computing solutions have inc...