Efficient performance tuning of parallel programs is often hard. In this paper we describe an approach that uses a uni-processor execution of a multithreaded pro-gram as reference to simulate a multiprocessor execution. The speed-up is predicted, and the program behaviour is visualized as a graph, which can be used in the perfor-mance tuning process. The simulator considers scheduling as well as hard-ware parameters, e.g., the thread priority, no. of LWPs, and no. of CPUs. The visualization part shows the simu-lated execution in two graphs: one showing the threads’ behaviour over time and the other the amount of parallel-ism over time. In the first graph is it possible to relate an event in the graph to the code line causing the event. Vali...
[[abstract]]©1988 North-Holland-The authors outline an approach to the design of a set of interactiv...
The combination of low cost clusters and multicore processors lowers the barrier for acces-sing mass...
International audienceFinely tuning MPI applications (number of processes, granularity, collectiveop...
Efficient performance tuning of parallel programs is often hard. In this paper we describe an approa...
Context. Today’s parallel systems are widely used in different computational tasks. Developing paral...
Performance tuning of a parallel application is often hard. The use of standards, such as POSIX thre...
The performance of a computer system is important. One way of improving performance is to use multip...
Introduction In general, a parallel computer is a computer that has multiple processors connected b...
The use of multiprocessors is an important way to increase the performance of a supercom-puting pr...
[[abstract]]In an environment for developing and running parallel programs, it is not always possibl...
The primary motivation behind building multiprocessors is to cost-effectively improve system perform...
Introduction In general, a parallel computer is a computer that has multiple processors connected b...
Designing parallel programs for message-passing systems is not an easy task. Difficulties arise larg...
The performance of an applications program running on a parallel machine is affected by several fact...
The simulation of parallel systems is an alternative approach to classical parallel system programmi...
[[abstract]]©1988 North-Holland-The authors outline an approach to the design of a set of interactiv...
The combination of low cost clusters and multicore processors lowers the barrier for acces-sing mass...
International audienceFinely tuning MPI applications (number of processes, granularity, collectiveop...
Efficient performance tuning of parallel programs is often hard. In this paper we describe an approa...
Context. Today’s parallel systems are widely used in different computational tasks. Developing paral...
Performance tuning of a parallel application is often hard. The use of standards, such as POSIX thre...
The performance of a computer system is important. One way of improving performance is to use multip...
Introduction In general, a parallel computer is a computer that has multiple processors connected b...
The use of multiprocessors is an important way to increase the performance of a supercom-puting pr...
[[abstract]]In an environment for developing and running parallel programs, it is not always possibl...
The primary motivation behind building multiprocessors is to cost-effectively improve system perform...
Introduction In general, a parallel computer is a computer that has multiple processors connected b...
Designing parallel programs for message-passing systems is not an easy task. Difficulties arise larg...
The performance of an applications program running on a parallel machine is affected by several fact...
The simulation of parallel systems is an alternative approach to classical parallel system programmi...
[[abstract]]©1988 North-Holland-The authors outline an approach to the design of a set of interactiv...
The combination of low cost clusters and multicore processors lowers the barrier for acces-sing mass...
International audienceFinely tuning MPI applications (number of processes, granularity, collectiveop...