Conclusions of this presentation are: (1) Open SpeedShop's (OSS) is convenient to use for large, parallel, scientific simulation codes; (2) Large codes benefit from uninstrumented execution; (3) Many experiments can be run in a short time - might need multiple shots e.g. usertime for caller-callee, hwcsamp for HW counters; (4) Decent idea of code's performance is easily obtained; (5) Statistical sampling calls for decent number of samples; and (6) HWC data is very useful for micro-analysis but can be tricky to analyze
International audienceComputing hardware, from mobile devices to supercomputer clusters, is undergoi...
Scalasca is a software tool that supports the performance optimization of parallel programs by measu...
Most experimental studies of the performance of parallel simulation protocols use speedup or number ...
There are a number of challenges facing the High Performance Computing (HPC) community, including in...
Fast computer simulation is an essential tool in the design of large parallel computers. Our Fast Ac...
Abstract—Increasing number of cores in parallel computer systems are allowing scientific simulations...
In this article the runtime behavior of the simulation software Fastest is investigated and its perf...
Performance prediction is a useful thing to do to help parallel programmers answer questions such as...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scie...
Simulations on HPC systems have become an indispensable key technology in modern science and enginee...
This paper introduces the EC frontend and DSIM simulator. Given a parallel program, they determine i...
This paper examines the cost/performance of simulating a hypothetical target parallel computer using...
One method to evaluate a distributed shared memory(DSM) system is to analyze its performance for a v...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scien...
International audienceComputing hardware, from mobile devices to supercomputer clusters, is undergoi...
Scalasca is a software tool that supports the performance optimization of parallel programs by measu...
Most experimental studies of the performance of parallel simulation protocols use speedup or number ...
There are a number of challenges facing the High Performance Computing (HPC) community, including in...
Fast computer simulation is an essential tool in the design of large parallel computers. Our Fast Ac...
Abstract—Increasing number of cores in parallel computer systems are allowing scientific simulations...
In this article the runtime behavior of the simulation software Fastest is investigated and its perf...
Performance prediction is a useful thing to do to help parallel programmers answer questions such as...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scie...
Simulations on HPC systems have become an indispensable key technology in modern science and enginee...
This paper introduces the EC frontend and DSIM simulator. Given a parallel program, they determine i...
This paper examines the cost/performance of simulating a hypothetical target parallel computer using...
One method to evaluate a distributed shared memory(DSM) system is to analyze its performance for a v...
Performance analysis tools are essential to the maintenance of efficient parallel execution of scien...
International audienceComputing hardware, from mobile devices to supercomputer clusters, is undergoi...
Scalasca is a software tool that supports the performance optimization of parallel programs by measu...
Most experimental studies of the performance of parallel simulation protocols use speedup or number ...