The performance of a Cray system is highly dependent on the tuning techniques used by individuals on their codes. Many of our users were not taking advantage of the tuning tools that allow them to monitor their own programs by using the Hardware Performance Monitor (HPM). We therefore modified UNICOS to collect HPM data for all processes and to report Mflop ratings based on users, programs, and time used. Our tuning efforts are now being focused on the users and programs that have the best potential for performance improvements. These modifications and some of the more striking performance improvements are described
This paper presents an automatic counter instrumentation and pro ling module added to the MPI librar...
Recent microprocessor advances have significantly improved the capabilities of on-chip performance m...
One of the reasons why parallel programming is considered to be a difficult task is that users frequ...
A Performance Analysis Tools (PAT) report implement-ing hpm and perftrace software, installed under ...
TOPAS is a tool to automatically and transparently monitor usage and performance of every parallel j...
CPU clock frequency is not likely to be increased significantly in the coming years, and data analys...
High-performance computing (HPC) systems with hardware-reconfigurable devices have the potential to ...
To be able to improve the performance of your system you need a prior understand-ing of what can be ...
We describe our experiences in repeated cycles of performance optimization, benchmarking, and perfor...
Part 8: Session 8: MicroarchitectureInternational audienceMicroprocessors continue to make great str...
We have developed an environment, based upon robust, existing, open source software, for tuning appl...
Application performance tuning is a complex process that requires assembling various types of inform...
Performance measurement and runtime tuning tools are both vital in the HPC software ecosystem and us...
Performance observability is the ability to accurately capture, analyze, and present (collectively o...
This thesis presents a new measurement methodology especially designed to improve the performance of...
This paper presents an automatic counter instrumentation and pro ling module added to the MPI librar...
Recent microprocessor advances have significantly improved the capabilities of on-chip performance m...
One of the reasons why parallel programming is considered to be a difficult task is that users frequ...
A Performance Analysis Tools (PAT) report implement-ing hpm and perftrace software, installed under ...
TOPAS is a tool to automatically and transparently monitor usage and performance of every parallel j...
CPU clock frequency is not likely to be increased significantly in the coming years, and data analys...
High-performance computing (HPC) systems with hardware-reconfigurable devices have the potential to ...
To be able to improve the performance of your system you need a prior understand-ing of what can be ...
We describe our experiences in repeated cycles of performance optimization, benchmarking, and perfor...
Part 8: Session 8: MicroarchitectureInternational audienceMicroprocessors continue to make great str...
We have developed an environment, based upon robust, existing, open source software, for tuning appl...
Application performance tuning is a complex process that requires assembling various types of inform...
Performance measurement and runtime tuning tools are both vital in the HPC software ecosystem and us...
Performance observability is the ability to accurately capture, analyze, and present (collectively o...
This thesis presents a new measurement methodology especially designed to improve the performance of...
This paper presents an automatic counter instrumentation and pro ling module added to the MPI librar...
Recent microprocessor advances have significantly improved the capabilities of on-chip performance m...
One of the reasons why parallel programming is considered to be a difficult task is that users frequ...