The interest towards Arm based platforms as HPC solutions increased significantly during the last 5 years. In this paper we show that, in contrast to the early days of pioneer tests, several application performance analysis techniques can now be applied also to Arm based SoCs. To show the possibilities offered by the available tools, we provide as an example, the analysis of a Lattice Boltzmann HPC production code, highly optimized for several architectures and now ported also to Armv8. We tested it on a system based on a production silicon, Cavium CN8890 SoC. In particular, as performance analysis tools we adopt Extrae and Paraver, making use of the PAPI support, initially developed by us for the ThunderX platform, and now available also u...
Simulations on HPC systems have become an indispensable key technology in modern science and enginee...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
One of the emerging architectures in HPC systems is Intel’s Knights Landing (KNL) many core chip, wh...
The interest towards Arm based platforms as HPC solutions increased significantly during the last 5 ...
The interest towards Arm based platforms as HPC solutions increased significantly during the last 5 ...
HPC systems and parallel applications are increasing their complexity. Therefore the possibility of ...
In this paper, we analyze the performance and energy consumption of an Arm-based high-performance co...
Performance analysis tools allow application developers to identify and characterize the inefficienc...
Performance analysis tools allow application developers to identify and characterize the inefficienc...
In the last years, the energy efficiency of HPC systems is increasingly becoming of paramount import...
Performance analysis tools allow application developers to identify and characterize the inefficienc...
Marvell’s ThunderX2 has been the first Arm-based processor with deployments in large-scale HPC produ...
While parallel applications in all scientific and engineering domains have always been prone to exec...
Many existing applications suffer from inherent scalability limitations that will prevent them from ...
The High-Performance Conjugate Gradient (HPCG) benchmark complements the LINPACK benchmark in the pe...
Simulations on HPC systems have become an indispensable key technology in modern science and enginee...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
One of the emerging architectures in HPC systems is Intel’s Knights Landing (KNL) many core chip, wh...
The interest towards Arm based platforms as HPC solutions increased significantly during the last 5 ...
The interest towards Arm based platforms as HPC solutions increased significantly during the last 5 ...
HPC systems and parallel applications are increasing their complexity. Therefore the possibility of ...
In this paper, we analyze the performance and energy consumption of an Arm-based high-performance co...
Performance analysis tools allow application developers to identify and characterize the inefficienc...
Performance analysis tools allow application developers to identify and characterize the inefficienc...
In the last years, the energy efficiency of HPC systems is increasingly becoming of paramount import...
Performance analysis tools allow application developers to identify and characterize the inefficienc...
Marvell’s ThunderX2 has been the first Arm-based processor with deployments in large-scale HPC produ...
While parallel applications in all scientific and engineering domains have always been prone to exec...
Many existing applications suffer from inherent scalability limitations that will prevent them from ...
The High-Performance Conjugate Gradient (HPCG) benchmark complements the LINPACK benchmark in the pe...
Simulations on HPC systems have become an indispensable key technology in modern science and enginee...
This tutorial presents state-of-the-art performance tools for leading-edge HPC systems founded on th...
One of the emerging architectures in HPC systems is Intel’s Knights Landing (KNL) many core chip, wh...