ABSTRACT Goal-Directed Performance Tuning for Scientific Applications by Tien-Pao Shih Chair: Edward S. Davidson Performance tuning, as carried out by compiler designers and application programmers to close the performance gap between the achievable peak and delivered performance, becomes increasingly important and challenging as the microprocessor speeds and system sizes increase. However, although performance tuning on scientific codes usually deals with relatively small program regions, it is not generally known how to establish a reasonable performance objective and how to efficiently achieve this objective. We suggest a goal-directed approach and develop such an approach for each of three major system performance components: central pr...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
The technological improvements in silicon manufacturing are yielding vast increases of processor &ap...
We have developed a hierarchical performance bounding methodology that attempts to explain the perfo...
Performance tuning, as carried out by compiler designers and application programmers to close the pe...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
We have developed a performance bounding methodology that explains the performance of loop-dominated...
Tuning the performance of applications requires understanding the interactions between code and targ...
Application performance on modern microprocessors depends heavily on performance related characteris...
Tuning the performance of applications requires understanding the interactions between code and targ...
An effective methodology of performance evaluation and improvement enables application developers to...
Measurements of actual supercomputer cache performance has not been previously undertaken. PFC-Sim i...
We present a cache performance modeling methodology that facilitates the tuning of uniprocessor cach...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
The recent transformation from an environment where gains in computational performance came from inc...
Workload characterization has been proven an essential tool to architecture design and performance e...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
The technological improvements in silicon manufacturing are yielding vast increases of processor &ap...
We have developed a hierarchical performance bounding methodology that attempts to explain the perfo...
Performance tuning, as carried out by compiler designers and application programmers to close the pe...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
We have developed a performance bounding methodology that explains the performance of loop-dominated...
Tuning the performance of applications requires understanding the interactions between code and targ...
Application performance on modern microprocessors depends heavily on performance related characteris...
Tuning the performance of applications requires understanding the interactions between code and targ...
An effective methodology of performance evaluation and improvement enables application developers to...
Measurements of actual supercomputer cache performance has not been previously undertaken. PFC-Sim i...
We present a cache performance modeling methodology that facilitates the tuning of uniprocessor cach...
Obtaining high performance without machine-specific tuning is an important goal of scientific applic...
The recent transformation from an environment where gains in computational performance came from inc...
Workload characterization has been proven an essential tool to architecture design and performance e...
Modern supercomputers deliver large computational power, but it is difficult for an application to e...
The technological improvements in silicon manufacturing are yielding vast increases of processor &ap...
We have developed a hierarchical performance bounding methodology that attempts to explain the perfo...