Performance analysis is a critical aspect of CPU design, but it has become more difficult during the past decade as physical constraints limit improvements in single-threaded performance. This dissertation analyzes three interrelated problems associated with effective performance analysis. First, high-level microarchitecture simulation is orders of magnitude slower than native execution. I propose a novel statistical sampling technique called LiveSim that dramatically reduces simulation time. Second, multithreaded benchmarks may use input sets that produce misleading results. I demonstrate, for the first time, the true scalability of the PARSEC benchmark suite using real multiprocessor systems, and show how to accurately evaluate the per...
Writing well-performing parallel programs is challenging in the multi-core processor era. In additio...
PhD ThesisCurrent microprocessors improve performance by exploiting instruction-level parallelism (I...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
Fast and accurate microprocessor simulation has long remained a challenge in the design and evaluati...
Fast and accurate microprocessor simulation has long remained a challenge in the design and evaluati...
Application performance on computer processors depends on a number of complex architectural and micr...
Application performance on computer processors depends on a number of complex architectural and micr...
To increase performance, modern processors employ complex techniques such as out-of-order pipelines ...
Performance comparisons are ubiquitous in computer science. The proceedings of most conferences are ...
Performance comparisons are ubiquitous in computer science. The proceedings of most conferences are ...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
DoctorProcessor microarchitectures have been evolving and getting sophisticated to meet increasing c...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
Tuning the performance of applications requires understanding the interactions between code and targ...
Current microprocessors improve performance by exploiting instruction-level parallelism (ILP). ILP h...
Writing well-performing parallel programs is challenging in the multi-core processor era. In additio...
PhD ThesisCurrent microprocessors improve performance by exploiting instruction-level parallelism (I...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
Fast and accurate microprocessor simulation has long remained a challenge in the design and evaluati...
Fast and accurate microprocessor simulation has long remained a challenge in the design and evaluati...
Application performance on computer processors depends on a number of complex architectural and micr...
Application performance on computer processors depends on a number of complex architectural and micr...
To increase performance, modern processors employ complex techniques such as out-of-order pipelines ...
Performance comparisons are ubiquitous in computer science. The proceedings of most conferences are ...
Performance comparisons are ubiquitous in computer science. The proceedings of most conferences are ...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
DoctorProcessor microarchitectures have been evolving and getting sophisticated to meet increasing c...
Systems for high performance computing are getting increasingly complex. On the one hand, the number...
Tuning the performance of applications requires understanding the interactions between code and targ...
Current microprocessors improve performance by exploiting instruction-level parallelism (ILP). ILP h...
Writing well-performing parallel programs is challenging in the multi-core processor era. In additio...
PhD ThesisCurrent microprocessors improve performance by exploiting instruction-level parallelism (I...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...