Performance analysis tools are essential to the maintenance of efficient parallel execution of scientific applications. As scientific applications are executed on larger and larger parallel supercomputers, it is clear that performance tools must employ more advanced techniques to keep up with the increasing data volume and complexity of the performance information generated by these applications as a result of scaling. In this thesis, we investigate the useful techniques in four main thrusts to address various aspects of this problem. First, we study how some traditional performance analysis idioms can break down in the face of data from large processor counts and demonstrate techniques and tools that restore scalability. Second, we inv...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
This paper presents scalability as a basis for profiling and performance debugging of parallel progr...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scie...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scien...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...
Achieving a significant fraction of peak performance on a modern high-performance computer is a chal...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
Concurrency levels in large-scale, distributed-memory supercomputers are rising exponentially. Moder...
The scalability of performance tools in high performance computing has been lagging behind the growt...
With larger and larger systems being constantly deployed, trace-based performance analysis of paral...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
Modern parallel systems and applications are constantly increasing in scale and complexity, and cons...
As access to supercomputing resources is becoming more and more commonplace, performance analysis to...
Concurrency levels in large-scale, distributed-memory supercomputers are rising exponentially. Moder...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
This paper presents scalability as a basis for profiling and performance debugging of parallel progr...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scie...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scien...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...
Achieving a significant fraction of peak performance on a modern high-performance computer is a chal...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
Concurrency levels in large-scale, distributed-memory supercomputers are rising exponentially. Moder...
The scalability of performance tools in high performance computing has been lagging behind the growt...
With larger and larger systems being constantly deployed, trace-based performance analysis of paral...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
Modern parallel systems and applications are constantly increasing in scale and complexity, and cons...
As access to supercomputing resources is becoming more and more commonplace, performance analysis to...
Concurrency levels in large-scale, distributed-memory supercomputers are rising exponentially. Moder...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
This paper presents scalability as a basis for profiling and performance debugging of parallel progr...
Most performance debugging and tuning of parallel programs is based on the "measure-modify"...