Modern supercomputers have complex features: many hardware threads, deep memory hierarchies, and many co-processors/accelerators. Productively and effectively designing programs to utilize those hardware features is crucial in gaining the best performance. There are several highly parallel programming models in active development that allow programmers to write efficient code on those architectures. Performance profiling is a very important technique in the development to achieve the best performance. In this dissertation, I proposed a new performance measurement and mapping technique that can associate performance data with program variables instead of code blocks. To validate the applicability of my data-centric profiling idea, I designe...
High Performance Computing (HPC) has always been a key foundation for scientific simulation and disc...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
Data-intensive programs deal with big chunks of data and often contain compute-intensive characteris...
The parallel programming come a long way with the advances in the HPC. The high performance computin...
The rise of chip multiprocessing or the integration of multiple general purpose processing cores on ...
During the past decades, High-Performance Computing (HPC) has been widely used in various industries...
It is desirable for general productivity that high-performance computing applications be portable to...
thesisTo address the need of understanding and optimizing the performance of complex applications an...
Thesis (Ph.D.)--University of Washington, 2016-08Applications in data science rely on two computing ...
Traditional methods of performance analysis offer a code centric view, presenting performance data i...
The software performance optimizations process is one of the most challenging aspects of developing ...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scien...
Graphics Processing Units (GPUs) have become a key technology for accelerating node performance in s...
Hyperscale Data Centers (HDCs) are the largest distributed computing machines ever constructed. They...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
High Performance Computing (HPC) has always been a key foundation for scientific simulation and disc...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
Data-intensive programs deal with big chunks of data and often contain compute-intensive characteris...
The parallel programming come a long way with the advances in the HPC. The high performance computin...
The rise of chip multiprocessing or the integration of multiple general purpose processing cores on ...
During the past decades, High-Performance Computing (HPC) has been widely used in various industries...
It is desirable for general productivity that high-performance computing applications be portable to...
thesisTo address the need of understanding and optimizing the performance of complex applications an...
Thesis (Ph.D.)--University of Washington, 2016-08Applications in data science rely on two computing ...
Traditional methods of performance analysis offer a code centric view, presenting performance data i...
The software performance optimizations process is one of the most challenging aspects of developing ...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scien...
Graphics Processing Units (GPUs) have become a key technology for accelerating node performance in s...
Hyperscale Data Centers (HDCs) are the largest distributed computing machines ever constructed. They...
Performance analysis of parallel programs continues to be challenging for programmers. Programmers h...
High Performance Computing (HPC) has always been a key foundation for scientific simulation and disc...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
Data-intensive programs deal with big chunks of data and often contain compute-intensive characteris...