International audienceIn this paper, the problem of evaluating the performance of parallel programs generated by data parallel compilers is studied. These compilers take as input an application written in a sequential language augmented with data distribution directives and produce a parallel version based on the specifed partitioning of data. A methodology for evaluating the relationships existing among the program characteristics, the data distribution adopted, and the performance indices measured during the program execution is described. It consists of three phases: a "static" description of the program under study, a "dynamic" description, based on the measurement and the analysis of its execution on a real system, and the construction...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
Abstract — A well organized parallel application can accomplish better performance over sequential e...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
International audienceIn this paper, the problem of evaluating the performance of parallel programs ...
International audienceIn this paper, we present the overall design of Pandore II, an Environment ded...
The area of parallelizing compilers for distributed memory multicomputers has seen considerable rese...
Processamento distribuído tem sido utilizado amplamente para melhorar o desempenho de aplicações com...
In the above raport the usage of the statistical methods to predict the efficiency of the parallel a...
This paper discusses the development of a portable suite of benchmarking programs for parallel comp...
Although parallel computers have existed for many years, recently there has been a surge of academic...
Because of physical limits, hardware designers have switched to parallel systems to exploit ...
In this paper, we prove that the data-driven parallelization technique, which compiles sequential pr...
This paper presents a comparative and qualitative survey of techniques for evaluating parallel sys...
(eng) In the data parallel programming style the user usually specifies the data parallelism explici...
The difficulty of programming distributed memory parallel architectures is an impediment to the expl...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
Abstract — A well organized parallel application can accomplish better performance over sequential e...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
International audienceIn this paper, the problem of evaluating the performance of parallel programs ...
International audienceIn this paper, we present the overall design of Pandore II, an Environment ded...
The area of parallelizing compilers for distributed memory multicomputers has seen considerable rese...
Processamento distribuído tem sido utilizado amplamente para melhorar o desempenho de aplicações com...
In the above raport the usage of the statistical methods to predict the efficiency of the parallel a...
This paper discusses the development of a portable suite of benchmarking programs for parallel comp...
Although parallel computers have existed for many years, recently there has been a surge of academic...
Because of physical limits, hardware designers have switched to parallel systems to exploit ...
In this paper, we prove that the data-driven parallelization technique, which compiles sequential pr...
This paper presents a comparative and qualitative survey of techniques for evaluating parallel sys...
(eng) In the data parallel programming style the user usually specifies the data parallelism explici...
The difficulty of programming distributed memory parallel architectures is an impediment to the expl...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
Abstract — A well organized parallel application can accomplish better performance over sequential e...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...