In this paper we describe how we improved the effective performance of ASCI Q, the world's second-fastest supercomputer, to meet our expectations. Using an arsenal of performance-analysis techniques including analytical models, custom microbenchmarks, full applications, and simulators, we succeeded in observing a serious-but previously undetectable-performance problem. We identified the source of the problem, eliminated the problem, and 'closed the loop' by demonstrating improved application performance. We present our methodology and provide insight into performance analysis that is immediately applicable to other large-scale cluster-based supercomputers
The next frontier of high performance computing is the Exascale, and this will certainly stand as a ...
Using an extremely large number of processing elements in computing systems leads to unexpected phen...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
Performance and scalability of high performance scientific applications on large scale parallel mach...
High Performance Computing (HPC) aims at providing reasonably fast computing solutions to both scien...
High Performance Computing (HPC) aims at providing reasonably fast computing solutions to scientific...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scien...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scie...
Detection, diagnosis and mitigation of performance problems in today\u27s large-scale distributed an...
Nowadays, the whole HPC community is looking forward to the exascale era, with computer and system a...
The basic architectures of vector and parallel computers and their properties are presented followed...
An effective methodology of performance evaluation and improvement enables application developers to...
Data parallel languages are gaining interest as it becomes clear that they support a wider range of ...
There is a growing gap between the peak speed of parallel computing systems and the actual delivered...
The next frontier of high performance computing is the Exascale, and this will certainly stand as a ...
Using an extremely large number of processing elements in computing systems leads to unexpected phen...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
Performance and scalability of high performance scientific applications on large scale parallel mach...
High Performance Computing (HPC) aims at providing reasonably fast computing solutions to both scien...
High Performance Computing (HPC) aims at providing reasonably fast computing solutions to scientific...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scien...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scie...
Detection, diagnosis and mitigation of performance problems in today\u27s large-scale distributed an...
Nowadays, the whole HPC community is looking forward to the exascale era, with computer and system a...
The basic architectures of vector and parallel computers and their properties are presented followed...
An effective methodology of performance evaluation and improvement enables application developers to...
Data parallel languages are gaining interest as it becomes clear that they support a wider range of ...
There is a growing gap between the peak speed of parallel computing systems and the actual delivered...
The next frontier of high performance computing is the Exascale, and this will certainly stand as a ...
Using an extremely large number of processing elements in computing systems leads to unexpected phen...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...