International audienceExploiting the power of HPC platforms requires knowledge of their increasingly complex hardware topologies. Multiple components of the software stack, for instance MPI implementations or OpenMP runtimes, now perform their own topology discovery to find out the available cores and memory, and to better place tasks based on their affinities.We study in this article the impact of this topology discovery in terms of memory footprint. Storing locality information wastes an amount of physical memory that is becoming an issue on many-core platforms on the road to exascale.We demonstrate that this information may be factorized between processes by using a shared-memory region. Our analysis of the physical and virtual memories ...
Abstract—Memory size has long limited large-scale appli-cations on high-performance computing (HPC) ...
International audienceThe MPI standard is a major contribution in the landscape of parallel programm...
Several benchmarks for measuring memory performance of HPC systems along dimensions of spatial and t...
International audienceThe increasing complexity of parallel computing platforms requires a deep know...
International audienceHigh-performance computing requires a deep knowledge of the hardware platform ...
International audienceModern computing platforms are increasingly complex, with multiple cores, shar...
Parallel computing platforms are increasingly complex, with multiple cores, shared caches, and NUMA ...
International audienceThe cost of data movement has always been an important concern in high perform...
Abstract—Modern computing platforms are increasingly com-plex, with multiple cores, shared caches, a...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
Several benchmarks for measuring memory performance of HPC systems along dimensions of spatial and t...
Abstract—Memory size has long limited large-scale appli-cations on high-performance computing (HPC) ...
Memory size has long limited large-scale applications on high-performance computing (HPC) systems. S...
Abstract—Memory size has long limited large-scale appli-cations on high-performance computing (HPC) ...
International audienceThe MPI standard is a major contribution in the landscape of parallel programm...
Several benchmarks for measuring memory performance of HPC systems along dimensions of spatial and t...
International audienceThe increasing complexity of parallel computing platforms requires a deep know...
International audienceHigh-performance computing requires a deep knowledge of the hardware platform ...
International audienceModern computing platforms are increasingly complex, with multiple cores, shar...
Parallel computing platforms are increasingly complex, with multiple cores, shared caches, and NUMA ...
International audienceThe cost of data movement has always been an important concern in high perform...
Abstract—Modern computing platforms are increasingly com-plex, with multiple cores, shared caches, a...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
Several benchmarks for measuring memory performance of HPC systems along dimensions of spatial and t...
Abstract—Memory size has long limited large-scale appli-cations on high-performance computing (HPC) ...
Memory size has long limited large-scale applications on high-performance computing (HPC) systems. S...
Abstract—Memory size has long limited large-scale appli-cations on high-performance computing (HPC) ...
International audienceThe MPI standard is a major contribution in the landscape of parallel programm...
Several benchmarks for measuring memory performance of HPC systems along dimensions of spatial and t...