The problem of exploiting the parallelism available in a program to efficiently employ the resources of the target machine is addressed. The problem is discussed in the context of building a mapping compiler for a distributed memory parallel machine. The paper describes using execution models to drive the process of mapping a program in the most efficient way onto a particular machine. Through analysis of the execution models for several mapping techniques for one class of programs, we show that the selection of the best technique for a particular program instance can make a significant difference in performance. On the other hand, the results of benchmarks from an implementation of a mapping compiler show that our execution models are accu...
In this paper we present a unified approach for compiling programs for Distributed-Memory Multiproce...
In scalable multiprocessor systems, high performance demands that computational load be balanced eve...
In scalable multiprocessor systems, high performance demands that computational load be balanced eve...
A faire apr`es Keywords: Parallel environment, Distributed-memory machines, Load-balancing, Mapping...
<p>The need for high-performance computing together with the increasing trend from single processor ...
For a wide variety of applications, both task and data parallelism must be exploited to achieve the ...
The need for high-performance computing together with the increasing trend from single processor to ...
The need for high-performance computing together with the increasing trend from single processor to ...
Mapping an application program to a parallel architecture can be described as a multidimensional opt...
Many heuristics have been created to solve the mapping problem. A set of mapping heuristics has been...
Abstract-For a wide variety of applications, both task and data parallelism must be exploited to ach...
The trend from single processor to parallel computer architectures has increased the importance of p...
An efficient parallel program designed for a parallel architecture includes a detailed outline of ac...
This paper presents the ANDES performance evaluation tool. ANDES is based on the synthetic execution...
An efficient parallel program designed for a parallel architecture includes a detailed outline of ac...
In this paper we present a unified approach for compiling programs for Distributed-Memory Multiproce...
In scalable multiprocessor systems, high performance demands that computational load be balanced eve...
In scalable multiprocessor systems, high performance demands that computational load be balanced eve...
A faire apr`es Keywords: Parallel environment, Distributed-memory machines, Load-balancing, Mapping...
<p>The need for high-performance computing together with the increasing trend from single processor ...
For a wide variety of applications, both task and data parallelism must be exploited to achieve the ...
The need for high-performance computing together with the increasing trend from single processor to ...
The need for high-performance computing together with the increasing trend from single processor to ...
Mapping an application program to a parallel architecture can be described as a multidimensional opt...
Many heuristics have been created to solve the mapping problem. A set of mapping heuristics has been...
Abstract-For a wide variety of applications, both task and data parallelism must be exploited to ach...
The trend from single processor to parallel computer architectures has increased the importance of p...
An efficient parallel program designed for a parallel architecture includes a detailed outline of ac...
This paper presents the ANDES performance evaluation tool. ANDES is based on the synthetic execution...
An efficient parallel program designed for a parallel architecture includes a detailed outline of ac...
In this paper we present a unified approach for compiling programs for Distributed-Memory Multiproce...
In scalable multiprocessor systems, high performance demands that computational load be balanced eve...
In scalable multiprocessor systems, high performance demands that computational load be balanced eve...