FPGA-based token dataflow processing has been shown to accelerate hard-to-parallelize problems exhibiting irregular dataflow parallelism by as much as an order of magnitude when compared to conventional compute organizations. However, when the structure of the dataflow computation is known upfront, either at compile time or at the start of execution, we can employ static scheduling techniques to further improve performance and enhance compute density of the dataflow hardware. In this paper, we identify the costs and performance trends of both static and dynamic scheduling approaches when considering hardware acceleration of SPICE device equations and Sparse LU factorization in circuit graphs. While the experiments are limited to a case stud...
Parallel graph-oriented applications expressed in the Bulk-Synchronous Parallel (BSP) and Token Data...
Dataflow machines can "unravel" loops automatically so that many iterations of a loop can execute i...
Developing and fine-tuning software programs for heterogeneous hardware such as CPU/GPU processing p...
FPGA-based token dataflow architectures with heterogeneous computation and communication subsystems ...
This paper describes and analyzes a paradigm for scheduling com-putations on a network of multiproce...
The ambitious challenges posed by next exascale computing systems may require a critical re-examinat...
We describe a new intermediate compiler representation, static token form, that is suitable for data...
Dataflow Coprocessor Overlay (DaCO) is an FPGA-tuned dataflow-driven overlay architecture that offer...
The path towards future high performance computers requires architectures able to efficiently run mu...
Abstract—The ambitious challenges posed by next exascale computing systems may require a critical re...
Abstract—The path towards future high performance comput-ers requires architectures able to efficien...
Abstract—Heterogeneous computing using FPGA accelerators is a promising approach to boost the perfor...
In this paper the Scheduled Dataflow (SDF) architecture - a decoupled memory/execution, multithreade...
This paper proposes a new high-level approach for optimising field programmable gate array (FPGA) de...
textMany digital signal processing and real-time streaming systems are modeled using dataflow graphs...
Parallel graph-oriented applications expressed in the Bulk-Synchronous Parallel (BSP) and Token Data...
Dataflow machines can "unravel" loops automatically so that many iterations of a loop can execute i...
Developing and fine-tuning software programs for heterogeneous hardware such as CPU/GPU processing p...
FPGA-based token dataflow architectures with heterogeneous computation and communication subsystems ...
This paper describes and analyzes a paradigm for scheduling com-putations on a network of multiproce...
The ambitious challenges posed by next exascale computing systems may require a critical re-examinat...
We describe a new intermediate compiler representation, static token form, that is suitable for data...
Dataflow Coprocessor Overlay (DaCO) is an FPGA-tuned dataflow-driven overlay architecture that offer...
The path towards future high performance computers requires architectures able to efficiently run mu...
Abstract—The ambitious challenges posed by next exascale computing systems may require a critical re...
Abstract—The path towards future high performance comput-ers requires architectures able to efficien...
Abstract—Heterogeneous computing using FPGA accelerators is a promising approach to boost the perfor...
In this paper the Scheduled Dataflow (SDF) architecture - a decoupled memory/execution, multithreade...
This paper proposes a new high-level approach for optimising field programmable gate array (FPGA) de...
textMany digital signal processing and real-time streaming systems are modeled using dataflow graphs...
Parallel graph-oriented applications expressed in the Bulk-Synchronous Parallel (BSP) and Token Data...
Dataflow machines can "unravel" loops automatically so that many iterations of a loop can execute i...
Developing and fine-tuning software programs for heterogeneous hardware such as CPU/GPU processing p...