Data-driven array architectures seem to be important alternatives for coarse-grained reconfigurable computing platforms. Their use has provided performance improvements over microprocessors and shorter programming cycles than FPGA-based platforms. As with other architectures, in data-driven architectures loop pipelining plays an important role to improve performance. Usually this kind of pipelining can be achieved using the dataflow software pipelining technique or other software pipelining approaches. Although performance improvements are achieved, those techniques heavily depend on the insertion of pipelining stages and thus require complex balancing efforts. Furthermore, those techniques statically define the pipelining and do not take f...
This paper discusses software pipelining for a new class of ar-chitectures that we call transport-tr...
Pipelining is an important technique in high-level synthesis, which overlaps the execution of succes...
Pipelining algorithms are typically concerned with improving only the steady-state performance, or t...
Sequences of data-dependent tasks, each one traversing large data sets, exist in many applications (...
Traditionally, software pipelining is applied either to the innermost loop of a given loop nest or f...
We present a technique for increasing the throughput of stream process-ing architectures by removing...
Traditionally, software pipelining is applied either to the innermost loop of a given loop nest or f...
In this paper, we propose a compiler method for software pipelining of loop nests on multi-core chip...
The term "dataflow" generally encompasses three distinct aspects of computation - a data-driven mode...
Pipelining is a well-known technique that enables parallel execution of loops with cross-iteration d...
[[abstract]]A systematic procedure for designing pipelined data-parallel algorithms that are suitabl...
Traditionally, software pipelining is applied either to the innermost loop of a given loop nest or f...
Dataflow machines can "unravel" loops automatically so that many iterations of a loop can execute i...
Developing efficient programs for many of the current parallel computers is not easy due to the arch...
Pipeline parallelism organizes a parallel program as a linear sequence of stages. Each stage process...
This paper discusses software pipelining for a new class of ar-chitectures that we call transport-tr...
Pipelining is an important technique in high-level synthesis, which overlaps the execution of succes...
Pipelining algorithms are typically concerned with improving only the steady-state performance, or t...
Sequences of data-dependent tasks, each one traversing large data sets, exist in many applications (...
Traditionally, software pipelining is applied either to the innermost loop of a given loop nest or f...
We present a technique for increasing the throughput of stream process-ing architectures by removing...
Traditionally, software pipelining is applied either to the innermost loop of a given loop nest or f...
In this paper, we propose a compiler method for software pipelining of loop nests on multi-core chip...
The term "dataflow" generally encompasses three distinct aspects of computation - a data-driven mode...
Pipelining is a well-known technique that enables parallel execution of loops with cross-iteration d...
[[abstract]]A systematic procedure for designing pipelined data-parallel algorithms that are suitabl...
Traditionally, software pipelining is applied either to the innermost loop of a given loop nest or f...
Dataflow machines can "unravel" loops automatically so that many iterations of a loop can execute i...
Developing efficient programs for many of the current parallel computers is not easy due to the arch...
Pipeline parallelism organizes a parallel program as a linear sequence of stages. Each stage process...
This paper discusses software pipelining for a new class of ar-chitectures that we call transport-tr...
Pipelining is an important technique in high-level synthesis, which overlaps the execution of succes...
Pipelining algorithms are typically concerned with improving only the steady-state performance, or t...