Predication is an essential technique to accelerate kernels with control flow on CGRAs. While state-based full predication (SFP) can remove wasteful power consumption on issuing/decoding instructions from conventional full predication, generating code for SFP is challenging for general CGRAs, especially when there are multiple conditionals to be handled due to exploiting data level parallelism. In this paper, we present a novel compiler framework addressing central issues such as how to express the parallelism between multiple conditionals, and how to allocate resources to them to maximize the parallelism. In particular, by separating the handling of control flow and data flow, our framework can be integrated with conventional mapping algor...
Coarse-Grained Reconfigurable Architectures(CGRAs) can be employed for accelerating computational wo...
abstract: Coarse Grain Reconfigurable Arrays (CGRAs) are promising accelerators capable of achievin...
The herein presented research is motivated by the need for reconfigurable, flexible computing arrays...
Control divergence poses many problems in parallelizing loops. While predicated execution is commonl...
The automatic parallelization of loops that contain complex computations is still a challenge for cu...
International audienceIn the approaching era of IoT, flexible and low power accelerators have become...
Irregular control-flow structures like deeply nested conditional branches are common in real-world s...
Coarse-Grained Reconfigurable Array (CGRA) processors accelerate inner loops of applications by expl...
Predicated execution is a promising architectural feature for exploiting instruction-level paralleli...
This paper presents a new approach for the detection of coarse-grain parallelism in loop nests that ...
Abstract—Data-parallel architectures must provide efficient support for complex control-flow constru...
[Abstract] Summary form only given. The automatic parallelization of loops that contain complex comp...
In this paper, we present a compilation flow for HPC kernels on the REDEFINE coarse-grain reconfigur...
In the approaching era of IoT, flexible and low power accelerators have become essential to meet agg...
For loop accelerators such as coarse-grained reconfigurable architectures (CGRAs) and GP-GPUs, neste...
Coarse-Grained Reconfigurable Architectures(CGRAs) can be employed for accelerating computational wo...
abstract: Coarse Grain Reconfigurable Arrays (CGRAs) are promising accelerators capable of achievin...
The herein presented research is motivated by the need for reconfigurable, flexible computing arrays...
Control divergence poses many problems in parallelizing loops. While predicated execution is commonl...
The automatic parallelization of loops that contain complex computations is still a challenge for cu...
International audienceIn the approaching era of IoT, flexible and low power accelerators have become...
Irregular control-flow structures like deeply nested conditional branches are common in real-world s...
Coarse-Grained Reconfigurable Array (CGRA) processors accelerate inner loops of applications by expl...
Predicated execution is a promising architectural feature for exploiting instruction-level paralleli...
This paper presents a new approach for the detection of coarse-grain parallelism in loop nests that ...
Abstract—Data-parallel architectures must provide efficient support for complex control-flow constru...
[Abstract] Summary form only given. The automatic parallelization of loops that contain complex comp...
In this paper, we present a compilation flow for HPC kernels on the REDEFINE coarse-grain reconfigur...
In the approaching era of IoT, flexible and low power accelerators have become essential to meet agg...
For loop accelerators such as coarse-grained reconfigurable architectures (CGRAs) and GP-GPUs, neste...
Coarse-Grained Reconfigurable Architectures(CGRAs) can be employed for accelerating computational wo...
abstract: Coarse Grain Reconfigurable Arrays (CGRAs) are promising accelerators capable of achievin...
The herein presented research is motivated by the need for reconfigurable, flexible computing arrays...