We analyse the inherent performance of parallel software. For this end we use a task graph to model software structure, and apply measures of performance based on the graph?s execution time, given enough processors. The task graph consists of tasks and precedence constraints between tasks: we show that performance cannot be improved by adding constraints to a task graph. Using this we derive known bounds on performance in a systematic manner by comparing the structure of a program to benchmark structures for which performance can be easily determined. Further, we simplify these bounds to highlight the importance of some summary parameters of the precedence structure. To conclude we examine an existing model of families of programs. We find ...
Many parallel algorithms can be modelled as directed acyclic task graphs. Recently, Degree of Simult...
The performance of a computer system is important. One way of improving performance is to use multip...
This thesis consists of two parts: performance bounds for scheduling algorithms for parallel program...
We analyse the inherent performance of parallel software. For this end we use a task graph to model ...
When a parallel computation is represented in a formalism that imposes series-parallel structure on ...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
Standards bodies and commercial software vendors have defined parallel constructs to harness the par...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
Abstract — A well organized parallel application can accomplish better performance over sequential e...
A parallel program can be represented as a directed acyclic graph. An im-portant performance bound i...
An effective methodology of performance evaluation and improvement enables application developers to...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
We have developed a hierarchical performance bounding methodology that attempts to explain the perfo...
Abstract. In this paper we estimate parallel execution times, based on identifying separate “parts ”...
Many parallel algorithms can be modelled as directed acyclic task graphs. Recently, Degree of Simult...
The performance of a computer system is important. One way of improving performance is to use multip...
This thesis consists of two parts: performance bounds for scheduling algorithms for parallel program...
We analyse the inherent performance of parallel software. For this end we use a task graph to model ...
When a parallel computation is represented in a formalism that imposes series-parallel structure on ...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
Standards bodies and commercial software vendors have defined parallel constructs to harness the par...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
Abstract — A well organized parallel application can accomplish better performance over sequential e...
A parallel program can be represented as a directed acyclic graph. An im-portant performance bound i...
An effective methodology of performance evaluation and improvement enables application developers to...
In this paper, we describe a model for determining the optimal data and computation decomposition fo...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
We have developed a hierarchical performance bounding methodology that attempts to explain the perfo...
Abstract. In this paper we estimate parallel execution times, based on identifying separate “parts ”...
Many parallel algorithms can be modelled as directed acyclic task graphs. Recently, Degree of Simult...
The performance of a computer system is important. One way of improving performance is to use multip...
This thesis consists of two parts: performance bounds for scheduling algorithms for parallel program...