Abstract — Current performance prediction analytical models try to characterize the performance behavior of actual machines through a small set of parameters. Due to different factors, the predicted times suffer sub-stantial deviations. A natural approach is to associate a different proportionality constant with each basic block of computation. In particular, the paper deals with a skeleton designed for parallel divide and conquer algo-rithms that provide hypercubical communications among processes. Our proposal is to introduce different kinds of components to the analytical model by associating a performance constant for each conceptual block of a skeleton. The trace files obtained from the execution of the resulting code using the program...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
An approach to the characterisation of parallel systems using a structured layered methodology is de...
The performance skeleton of an application is a short running program whose performance in any scena...
Current performance prediction analytical models try to characterize the performance behavior of act...
Parallel divide and conquer computations, encompassing a wide variety of applications, can be modele...
The performance skeleton of an application is a short running program whose performance in any scena...
Abstract. In this paper we estimate parallel execution times, based on identifying separate “parts ”...
Divide{and{conquer algorithms obtain the solution to a problem by recursively dividing it into subpr...
Abstract. We show in this paper how to evaluate the performance of skeleton-based high level paralle...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
Abstra t. We show in this paper how to evaluate the performan e of pipeline-stru tured parallel prog...
The design of high-performance computing architectures requires performance analysis of large-scale ...
The increase in the use of parallel distributed architec-tures in order to solve large-scale scienti...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
Performance modeling plays a significant role in predicting the effects of a particular design choic...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
An approach to the characterisation of parallel systems using a structured layered methodology is de...
The performance skeleton of an application is a short running program whose performance in any scena...
Current performance prediction analytical models try to characterize the performance behavior of act...
Parallel divide and conquer computations, encompassing a wide variety of applications, can be modele...
The performance skeleton of an application is a short running program whose performance in any scena...
Abstract. In this paper we estimate parallel execution times, based on identifying separate “parts ”...
Divide{and{conquer algorithms obtain the solution to a problem by recursively dividing it into subpr...
Abstract. We show in this paper how to evaluate the performance of skeleton-based high level paralle...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
Abstra t. We show in this paper how to evaluate the performan e of pipeline-stru tured parallel prog...
The design of high-performance computing architectures requires performance analysis of large-scale ...
The increase in the use of parallel distributed architec-tures in order to solve large-scale scienti...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
Performance modeling plays a significant role in predicting the effects of a particular design choic...
To continuously comply with Moore's Law, modern parallel machines become increasingly complex. Effec...
An approach to the characterisation of parallel systems using a structured layered methodology is de...
The performance skeleton of an application is a short running program whose performance in any scena...