This paper presents a framework based on an user driven methodology to obtain analytical models on parallel systems and, in particular, clusters. This framework consists of two intercon-nected stages. In the first one, the analyst instruments the source code and some performance parameters are monitored. In the second one, the monitored data are used to obtain an ana-lytical model using statistical processes. The main functionalities added to the analysis stage include an automatic fit process that provides accurate performance models and the automatic data collection from monitoring. Some examples are used to show the automatic fit process. The accuracy of the models is compared with a complexity study of the selected examples.
Abstract — A parallel program should be evaluated to determine its efficiency, accuracy and benefits...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
A new approach to monitoring the runtime behaviour of parallel programs will be presented. Our appro...
In the above raport the usage of the statistical methods to predict the efficiency of the parallel a...
Measuring the performance of parallel codes is a compromise between lots of factors. The most import...
Fundamental to the development and use of parallel and distributed systems is the ability to observe...
Although there are many situations in which a model of application performance is valuable, performa...
Building parameterized performance models of applications in an automatic way is difficult because o...
High Performance Computing and Supercomputing is the high end area of the computing science that stu...
We propose a new model for parallel speedup that is based on two parameters, the average parallelism...
When implementing parallel programs for parallel computer systems the performance scalability of the...
Many parallel applications suffer from latent performance limitations that may prevent them from sca...
Parallel and distributed programming constitutes a highly promising approach to improving the perfor...
The evolution of parallel and distributed architectures and programming paradigms for performance-or...
Abstract — A parallel program should be evaluated to determine its efficiency, accuracy and benefits...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
A new approach to monitoring the runtime behaviour of parallel programs will be presented. Our appro...
In the above raport the usage of the statistical methods to predict the efficiency of the parallel a...
Measuring the performance of parallel codes is a compromise between lots of factors. The most import...
Fundamental to the development and use of parallel and distributed systems is the ability to observe...
Although there are many situations in which a model of application performance is valuable, performa...
Building parameterized performance models of applications in an automatic way is difficult because o...
High Performance Computing and Supercomputing is the high end area of the computing science that stu...
We propose a new model for parallel speedup that is based on two parameters, the average parallelism...
When implementing parallel programs for parallel computer systems the performance scalability of the...
Many parallel applications suffer from latent performance limitations that may prevent them from sca...
Parallel and distributed programming constitutes a highly promising approach to improving the perfor...
The evolution of parallel and distributed architectures and programming paradigms for performance-or...
Abstract — A parallel program should be evaluated to determine its efficiency, accuracy and benefits...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...