Abstract — In this paper we give a theoretical model for determining the synchronization frequency that minimizes the parallel execution time of loops with uniform dependencies dy-namically scheduled on heterogeneous systems. Using this model we determine the synchronization frequency that minimizes the estimated parallel time. The accuracy of our method is validated through experiments on a heterogeneous cluster. The results show that the synchronization frequency minimizing the parallel time determined by our method, is very close to the synchronization frequency found experimentally. I
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
Loops that synchronize parallel processors at the end of each iteration are compared with loops that...
Parallel programming is an intellectually demanding task. One of the most difficult challenges in th...
Developing efficient programs for distributed systems is difficult because computations must be effi...
on heterogeneous systems [3] [4] • Problem: determining the optimal synchronization frequency betwee...
For bulk synchronous computations that have nondeterministic behaviors, dynamic remapping is an effe...
We present two algorithms to minimize the amount of synchronization added when parallelizing a loop ...
As the complexity of parallel computers grows, constraints posed by the construction of larger syste...
There has been a recent increase of interest in heterogeneous computing systems, due partly to the f...
For bulk synchronous computations that have non-deterministic behaviors, dynamic remapping is an eff...
This paper presents a pragmatic algorithm to buil a global time on any distributed system, which is ...
In this paper, we study the problem of scheduling parallel loops at compile-time for a heterogeneous...
A methodology is introduced for minimizing the total execution time for a class of large-scale paral...
This paper addresses the problem of load balancing data-parallel computations on heterogeneous and t...
Developers of scalable libraries and applications for distributed-memory parallel systems face many ...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
Loops that synchronize parallel processors at the end of each iteration are compared with loops that...
Parallel programming is an intellectually demanding task. One of the most difficult challenges in th...
Developing efficient programs for distributed systems is difficult because computations must be effi...
on heterogeneous systems [3] [4] • Problem: determining the optimal synchronization frequency betwee...
For bulk synchronous computations that have nondeterministic behaviors, dynamic remapping is an effe...
We present two algorithms to minimize the amount of synchronization added when parallelizing a loop ...
As the complexity of parallel computers grows, constraints posed by the construction of larger syste...
There has been a recent increase of interest in heterogeneous computing systems, due partly to the f...
For bulk synchronous computations that have non-deterministic behaviors, dynamic remapping is an eff...
This paper presents a pragmatic algorithm to buil a global time on any distributed system, which is ...
In this paper, we study the problem of scheduling parallel loops at compile-time for a heterogeneous...
A methodology is introduced for minimizing the total execution time for a class of large-scale paral...
This paper addresses the problem of load balancing data-parallel computations on heterogeneous and t...
Developers of scalable libraries and applications for distributed-memory parallel systems face many ...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
Loops that synchronize parallel processors at the end of each iteration are compared with loops that...
Parallel programming is an intellectually demanding task. One of the most difficult challenges in th...