The interest towards Arm based platforms as HPC solutions increased significantly during the last 5 years. In this paper we show that, in contrast to the early days of pioneer tests, several application performance analysis techniques can now be applied also to Arm based SoCs. To show the possibilities offered by the available tools, we provide as an example, the analysis of a Lattice Boltzmann HPC production code, highly optimized for several architectures and now ported also to Armv8. We tested it on a system based on a production silicon, Cavium CN8890 SoC. In particular, as performance analysis tools we adopt Extrae and Paraver, making use of the PAPI support, initially developed by us for the ThunderX platform, and now available also u...
The complexity of modern High-Performance-Computing systems impose great challenges on running paral...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
Performance measurement and analysis of parallel applications is often challenging, despite many exc...
The interest towards Arm based platforms as HPC solutions increased significantly during the last 5 ...
HPC systems and parallel applications are increasing their complexity. Therefore the possibility of ...
In this paper, we analyze the performance and energy consumption of an Arm-based high-performance co...
Performance analysis tools allow application developers to identify and characterize the inefficienc...
Performance analysis tools allow application developers to identify and characterize the inefficienc...
In the last years, the energy efficiency of HPC systems is increasingly becoming of paramount import...
Performance analysis tools allow application developers to identify and characterize the inefficienc...
Simulations on HPC systems have become an indispensable key technology in modern science and enginee...
The usage of modern profiling and tracing tools is vital for understanding program behaviour, perfor...
Marvell’s ThunderX2 has been the first Arm-based processor with deployments in large-scale HPC produ...
Many existing applications suffer from inherent scalability limitations that will prevent them from ...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
The complexity of modern High-Performance-Computing systems impose great challenges on running paral...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
Performance measurement and analysis of parallel applications is often challenging, despite many exc...
The interest towards Arm based platforms as HPC solutions increased significantly during the last 5 ...
HPC systems and parallel applications are increasing their complexity. Therefore the possibility of ...
In this paper, we analyze the performance and energy consumption of an Arm-based high-performance co...
Performance analysis tools allow application developers to identify and characterize the inefficienc...
Performance analysis tools allow application developers to identify and characterize the inefficienc...
In the last years, the energy efficiency of HPC systems is increasingly becoming of paramount import...
Performance analysis tools allow application developers to identify and characterize the inefficienc...
Simulations on HPC systems have become an indispensable key technology in modern science and enginee...
The usage of modern profiling and tracing tools is vital for understanding program behaviour, perfor...
Marvell’s ThunderX2 has been the first Arm-based processor with deployments in large-scale HPC produ...
Many existing applications suffer from inherent scalability limitations that will prevent them from ...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
The complexity of modern High-Performance-Computing systems impose great challenges on running paral...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
Performance measurement and analysis of parallel applications is often challenging, despite many exc...