Future supercomputers will require application developers to expose much more parallelism than current applications expose. In order to assist application developers in structuring their applications such that this is possible, new programming models and libraries are emerging, the many-task runtimes, to allow for the expression of orders of magnitude more parallelism than currently existing models. This dissertation describes the challenges that these emerging many-task runtimes will place on performance analysis, and proposes deep integration between runtimes and performance tools as a means of producing correct, insightful, and actionable performance results. I show how tool-runtime integration can be used to aid programmer understand...
High Performance Computing (HPC) has always been a key foundation for scientific simulation and disc...
Advances in processors architecture, such as multicore, increase the size of complexity of parallel ...
Achieving a significant fraction of peak performance on a modern high-performance computer is a chal...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
Parallel task-based programming models like OpenMP support the declaration of task data dependences....
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
OpenMP is a popular application programming interface (API) used to write shared-memory parallel pro...
Parallelism has become more and more commonplace with the advent of the multicore processors. Althou...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
The purpose of this project was to build an extensible cross-platform infrastructure to facilitate t...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
peer-reviewedIn recent years there has been a shift in microprocessor manufacture from building sing...
HPC application developers encounter significant challenges getting their codes to run correctly on ...
Performance is the critical feature in the design and productivity of software systems. A key to imp...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
High Performance Computing (HPC) has always been a key foundation for scientific simulation and disc...
Advances in processors architecture, such as multicore, increase the size of complexity of parallel ...
Achieving a significant fraction of peak performance on a modern high-performance computer is a chal...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
Parallel task-based programming models like OpenMP support the declaration of task data dependences....
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
OpenMP is a popular application programming interface (API) used to write shared-memory parallel pro...
Parallelism has become more and more commonplace with the advent of the multicore processors. Althou...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
The purpose of this project was to build an extensible cross-platform infrastructure to facilitate t...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
peer-reviewedIn recent years there has been a shift in microprocessor manufacture from building sing...
HPC application developers encounter significant challenges getting their codes to run correctly on ...
Performance is the critical feature in the design and productivity of software systems. A key to imp...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
High Performance Computing (HPC) has always been a key foundation for scientific simulation and disc...
Advances in processors architecture, such as multicore, increase the size of complexity of parallel ...
Achieving a significant fraction of peak performance on a modern high-performance computer is a chal...