There is a need to increase performance under the same power and area envelope to achieve Exascale technology in high performance computing (HPC). The today's chip multiprocessor (CMP) design is tailored by traditional desktop and server workloads, different from parallel applications commonly run in HPC. In this work, we focus on the HPC code characteristics and processor front-end which factors around 30% of core power and area on the emerging lean-core type of processors used in HPC. Separating serial from parallel code sections inside applications, we characterize three HPC benchmark suites and compare them to a traditional set of desktop integer workloads. HPC applications have biased and mostly backward taken branches, small dynamic i...
While parallel applications in all scientific and engineering domains have always been prone to exec...
The landscape of High Performance Computing (HPC) system architectures keeps expanding with new tech...
International audienceComputing systems with a large number of processing units are increasingly com...
There is a need to increase performance under the same power and area envelope to achieve Exascale t...
High performance computing (HPC) applications have parallel code sections that must scale to large n...
Nowadays, the whole HPC community is looking forward to the exascale era, with computer and system a...
Asymmetric multi-cores (AMCs) are a successful architectural solution for both mobile devices and su...
Performance measurement and analysis of parallel applications is often challenging, despite many exc...
An important aspect of High-Performance Computing (HPC) system design is the choice of main memory c...
The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-96983-1_10Des...
High Performance Computing (HPC) aims at providing reasonably fast computing solutions to scientific...
As high-performance computing (HPC) systems advance towards exascale (10^18 operations per second), ...
Computing technologies populating high-performance computing (HPC) clusters are getting more and mor...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
Many studies have shown that load imbalancing causes significant performance degradation in High Per...
While parallel applications in all scientific and engineering domains have always been prone to exec...
The landscape of High Performance Computing (HPC) system architectures keeps expanding with new tech...
International audienceComputing systems with a large number of processing units are increasingly com...
There is a need to increase performance under the same power and area envelope to achieve Exascale t...
High performance computing (HPC) applications have parallel code sections that must scale to large n...
Nowadays, the whole HPC community is looking forward to the exascale era, with computer and system a...
Asymmetric multi-cores (AMCs) are a successful architectural solution for both mobile devices and su...
Performance measurement and analysis of parallel applications is often challenging, despite many exc...
An important aspect of High-Performance Computing (HPC) system design is the choice of main memory c...
The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-96983-1_10Des...
High Performance Computing (HPC) aims at providing reasonably fast computing solutions to scientific...
As high-performance computing (HPC) systems advance towards exascale (10^18 operations per second), ...
Computing technologies populating high-performance computing (HPC) clusters are getting more and mor...
HPC applications are often very complex and their behavior depends on a wide range of factors from a...
Many studies have shown that load imbalancing causes significant performance degradation in High Per...
While parallel applications in all scientific and engineering domains have always been prone to exec...
The landscape of High Performance Computing (HPC) system architectures keeps expanding with new tech...
International audienceComputing systems with a large number of processing units are increasingly com...