As multicore processors implementing shared-memory programming models have become commonplace, analysis tools for shared-memory programs have become increasingly important. One common general-purpose analysis method is reuse distance, which measures locality in application memory reference behavior and is used for predicting cache performance, driving compiler-based optimization, and visualization and manual optimization of programs. This thesis presents and validates methods to extend reuse distance analysis of application locality characteristics to shared-memory multicore platforms by accounting for invalidation-based cache-coherence and inter-core cache sharing. Existing reuse distance analysis methods track the number of distinct addre...
The potential for improving the performance of data-intensive scientific programs by enhancing data ...
Abstract. This paper proposes a methodology to study the data reuse quality of task-parallel runtime...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2014.As multi-core processors b...
This paper presents and validates methods to extend reuse distance analysis of application locality ...
Cache is one of the most widely used components in today's computing systems. Its performance is hea...
The performance and energy efficiency of multicore systems are increasingly dominated by the costs o...
As computing efficiency becomes constrained by hardware scaling limitations, code optimization grows...
Understanding multicore memory behavior is crucial, but can be challenging due to the cache hierarc...
Understanding multicore memory behavior is crucial, but can be challenging due to the complex cache ...
Feedback-directed optimization has become an increasingly important tool in designing and building o...
Profiling can effectively analyze program behavior and provide critical information for feedback-dir...
Feedback-directed optimization has become an increasingly impor-tant tool in designing and building ...
Abstract. Profiling can effectively analyze program behavior and provide critical information for fe...
Emerging computer architectures will feature drastically decreased flops/byte (ratio of peak process...
We propose COMDETECTIVE+, an inter-thread communication analyzer, and REUSETRACKER+, a reuse distanc...
The potential for improving the performance of data-intensive scientific programs by enhancing data ...
Abstract. This paper proposes a methodology to study the data reuse quality of task-parallel runtime...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2014.As multi-core processors b...
This paper presents and validates methods to extend reuse distance analysis of application locality ...
Cache is one of the most widely used components in today's computing systems. Its performance is hea...
The performance and energy efficiency of multicore systems are increasingly dominated by the costs o...
As computing efficiency becomes constrained by hardware scaling limitations, code optimization grows...
Understanding multicore memory behavior is crucial, but can be challenging due to the cache hierarc...
Understanding multicore memory behavior is crucial, but can be challenging due to the complex cache ...
Feedback-directed optimization has become an increasingly important tool in designing and building o...
Profiling can effectively analyze program behavior and provide critical information for feedback-dir...
Feedback-directed optimization has become an increasingly impor-tant tool in designing and building ...
Abstract. Profiling can effectively analyze program behavior and provide critical information for fe...
Emerging computer architectures will feature drastically decreased flops/byte (ratio of peak process...
We propose COMDETECTIVE+, an inter-thread communication analyzer, and REUSETRACKER+, a reuse distanc...
The potential for improving the performance of data-intensive scientific programs by enhancing data ...
Abstract. This paper proposes a methodology to study the data reuse quality of task-parallel runtime...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2014.As multi-core processors b...