In this contribution we present an optimised method for mapping of data-flow graphs onto parallel processor networks. Its main advantage is the exact consideration of overhead times for data exchange within the network. Therefore, the well known Dynamic Level Scheduler has been extended by a software router which is capable of calculating communication costs for all commonly used network topologies. This task is based on a special minimal routing algorithm for reconfigurable processor networks. Comparisons between the original and our modified scheduling algorithm will show significant reduction of makespan. A possible integration of the algorithm in software environments for parallel processing will be demonstrated by means of the framewor...
Data parallel programming provides a simple and powerful framework for designing parallel algorithms...
Four scheduling strategies for dataflow graphs onto parallel processors are classified: (1) fully dy...
In a multicomputer network, each task is decomposed into modules which are then executed by a number...
; p j ) the earliest time, when all input data for vertex v i are available at processor p j . The a...
Many computation-intensive or recursive applications commonly found in digital signal processing and...
In this paper, we survey algorithms that allocate a parallel program represented by an edge-weighted...
textMany digital signal processing and real-time streaming systems are modeled using dataflow graphs...
Abstract 1 In this paper, we survey algorithms that allocate a parallel program represented by an ed...
Numerous customary applications in digital signal processing may be characterized by synchronous dat...
In order to accelerate the execution of streaming applications on multi-core systems, this article s...
Applications such as image processing, fluid mechanics, and geophysical data analysis are examples o...
The objective of this research is to propose a low-complexity static scheduling and allocation algor...
The objective of this research is to propose a lowcomplexity static scheduling and allocation algori...
AbstractA model for parallel and distributed programs, the dynamic process graph (DPG), is investiga...
This paper explores the problem of efficiently ordering inter-processor communication operations in ...
Data parallel programming provides a simple and powerful framework for designing parallel algorithms...
Four scheduling strategies for dataflow graphs onto parallel processors are classified: (1) fully dy...
In a multicomputer network, each task is decomposed into modules which are then executed by a number...
; p j ) the earliest time, when all input data for vertex v i are available at processor p j . The a...
Many computation-intensive or recursive applications commonly found in digital signal processing and...
In this paper, we survey algorithms that allocate a parallel program represented by an edge-weighted...
textMany digital signal processing and real-time streaming systems are modeled using dataflow graphs...
Abstract 1 In this paper, we survey algorithms that allocate a parallel program represented by an ed...
Numerous customary applications in digital signal processing may be characterized by synchronous dat...
In order to accelerate the execution of streaming applications on multi-core systems, this article s...
Applications such as image processing, fluid mechanics, and geophysical data analysis are examples o...
The objective of this research is to propose a low-complexity static scheduling and allocation algor...
The objective of this research is to propose a lowcomplexity static scheduling and allocation algori...
AbstractA model for parallel and distributed programs, the dynamic process graph (DPG), is investiga...
This paper explores the problem of efficiently ordering inter-processor communication operations in ...
Data parallel programming provides a simple and powerful framework for designing parallel algorithms...
Four scheduling strategies for dataflow graphs onto parallel processors are classified: (1) fully dy...
In a multicomputer network, each task is decomposed into modules which are then executed by a number...