In this dissertation we approach the study of Precise Event-Based Sampling (PEBS) techniques to improve the performance of applications on a NUMA, Itanium2-based system. We demonstrate that a low-cost, PEBS profiling can support strategies to improve the performance of an important group of computational and scientific codes in runtime. In addition, the accurate information provided by the new Event Adress Registers (EAR) of the Intel Itanium architecture helps foster the development of new data allocation strategies. Following this line, we have also developed a series of dynamic page migration PEBS strategies. Specifically, two problems are addressed: how to improve the performance of locality optimisation techniques for irregular codes i...
Commercial link : http://www.springerlink.de/ ALCHEMY/http://www.springer.comCache memories were inv...
Enhancing the match between software executions and hardware features is key to computing efficiency...
The objective of this research is to improve the performance of sparse problems that have a wide ran...
In this thesis, we propose and evaluate several techniques to dynamically increase the memory access...
With the slowing or even death of Moore’s Law, computer system architectures are trending toward mor...
Many data-intensive applications exhibit poor temporal and spatial locality and perform poorly on co...
An increasing prevalence of data-irregularity is being seen in applications today, particularly in m...
Software prefetching and locality optimizations are two techniques for overcoming the speed gap bet...
This thesis presents a systematic study of two modes of program execution: synchronous and asynchron...
This dissertation maps various kernels and applications to a spectrum of programming models and arch...
The growing gap between processor and memory speeds results in complex memory hierarchies as process...
This paper introduces two novel algorithms for thread migrations, named CIMAR (Core-aware Interchang...
Computer simulation has become increasingly important in many scientiï¬c disciplines, but its perfor...
An important class of scientific codes access memory in an irregular manner. Because irregular acce...
The speed of processors increases much faster than the memory access time. This makes memory accesse...
Commercial link : http://www.springerlink.de/ ALCHEMY/http://www.springer.comCache memories were inv...
Enhancing the match between software executions and hardware features is key to computing efficiency...
The objective of this research is to improve the performance of sparse problems that have a wide ran...
In this thesis, we propose and evaluate several techniques to dynamically increase the memory access...
With the slowing or even death of Moore’s Law, computer system architectures are trending toward mor...
Many data-intensive applications exhibit poor temporal and spatial locality and perform poorly on co...
An increasing prevalence of data-irregularity is being seen in applications today, particularly in m...
Software prefetching and locality optimizations are two techniques for overcoming the speed gap bet...
This thesis presents a systematic study of two modes of program execution: synchronous and asynchron...
This dissertation maps various kernels and applications to a spectrum of programming models and arch...
The growing gap between processor and memory speeds results in complex memory hierarchies as process...
This paper introduces two novel algorithms for thread migrations, named CIMAR (Core-aware Interchang...
Computer simulation has become increasingly important in many scientiï¬c disciplines, but its perfor...
An important class of scientific codes access memory in an irregular manner. Because irregular acce...
The speed of processors increases much faster than the memory access time. This makes memory accesse...
Commercial link : http://www.springerlink.de/ ALCHEMY/http://www.springer.comCache memories were inv...
Enhancing the match between software executions and hardware features is key to computing efficiency...
The objective of this research is to improve the performance of sparse problems that have a wide ran...