Application performance models are important for both software and hardware development. They can be used to understand and improve application performance, to determine what architectural features are important to a particular program component, or to guide the design of new architectures. Creating accurate performance models of most computations typically requires significant expertise, human effort, and computational resources. Moreover, even when performed by experts, it is necessarily limited in scope, accuracy, or both. This research considers a number of novel static program analysis techniques to create performance-related program representations of high-performance computations. These program representations can be used to model pe...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
This thesis presents a generic approach towards compiling fast execution-driven simulators, and appl...
Developing programs that fully utilize the available computing capabilities of the underlying hardwa...
Abstract. Performance understanding and prediction are extremely im-portant goals for guiding the ap...
Cavazos, JohnIt has been shown that machine-learning driven optimizations often outperform bundled o...
Projecting performance of applications and hardware is important to several market segments—hardware...
Tuning the performance of applications requires understanding the interactions between code and targ...
Developing an optimizing compiler for a newly proposed architecture is extremely difficult when ther...
Applying the right compiler optimizations to a particular program can have a significant impact on p...
To analyze the performance of applications and architectures, both programmers and architects desire...
This paper proposes the use of empirical modeling techniques for building microarchitecture sensitiv...
Performance prediction models at the source code level are crucial components in advanced optimizing...
Achieving good performance on high-end computing systems is growing ever more challenging due to eno...
Cavazos, JohnThe number of optimizations that are available in modern day compilers are in their hun...
Tuning the performance of applications requires understanding the interactions between code and targ...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
This thesis presents a generic approach towards compiling fast execution-driven simulators, and appl...
Developing programs that fully utilize the available computing capabilities of the underlying hardwa...
Abstract. Performance understanding and prediction are extremely im-portant goals for guiding the ap...
Cavazos, JohnIt has been shown that machine-learning driven optimizations often outperform bundled o...
Projecting performance of applications and hardware is important to several market segments—hardware...
Tuning the performance of applications requires understanding the interactions between code and targ...
Developing an optimizing compiler for a newly proposed architecture is extremely difficult when ther...
Applying the right compiler optimizations to a particular program can have a significant impact on p...
To analyze the performance of applications and architectures, both programmers and architects desire...
This paper proposes the use of empirical modeling techniques for building microarchitecture sensitiv...
Performance prediction models at the source code level are crucial components in advanced optimizing...
Achieving good performance on high-end computing systems is growing ever more challenging due to eno...
Cavazos, JohnThe number of optimizations that are available in modern day compilers are in their hun...
Tuning the performance of applications requires understanding the interactions between code and targ...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
This thesis presents a generic approach towards compiling fast execution-driven simulators, and appl...
Developing programs that fully utilize the available computing capabilities of the underlying hardwa...