Parallel computing platforms are increasingly complex, with multiple cores, shared caches, and NUMA memory interconnects, as well as asymmetric I/O access. Upcoming architectures will add a heterogeneous memory subsystem with non-volatile and/or high-bandwidth memory banks. Parallel applications developers have to take locality into account before they can expect good efficiency on these platforms. Thus there is a strong need for a portable tool gathering and exposing this information. The Hardware Locality project (hwloc) offers a tree representation of the hardware based on the inclusion of CPU resources and localities of memory and I/O devices. It is already widely used for affinity-based task placement in high performance computing. We ...
Recently, multi-cores chips have become omnipresent in computer systems ranging from high-end server...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
International audienceThe task-based approach has emerged as a viable way to effectively use modern ...
International audienceModern computing platforms are increasingly complex, with multiple cores, shar...
Abstract—Modern computing platforms are increasingly com-plex, with multiple cores, shared caches, a...
International audienceHigh-performance computing requires a deep knowledge of the hardware platform ...
International audienceThe increasing complexity of parallel computing platforms requires a deep know...
International audienceExploiting the power of HPC platforms requires knowledge of their increasingly...
International audienceNowadays, performance optimization involves careful data and task placement to...
International audienceThe MPI standard is a major contribution in the landscape of parallel programm...
International audienceThe cost of data movement has always been an important concern in high perform...
International audienceThe ordered read-write lock model (ORWL) is a modern framework that proposes h...
International audienceThe task-based approach has emerged as a viable way to effectively use modern ...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
Recently, multi-cores chips have become omnipresent in computer systems ranging from high-end server...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
International audienceThe task-based approach has emerged as a viable way to effectively use modern ...
International audienceModern computing platforms are increasingly complex, with multiple cores, shar...
Abstract—Modern computing platforms are increasingly com-plex, with multiple cores, shared caches, a...
International audienceHigh-performance computing requires a deep knowledge of the hardware platform ...
International audienceThe increasing complexity of parallel computing platforms requires a deep know...
International audienceExploiting the power of HPC platforms requires knowledge of their increasingly...
International audienceNowadays, performance optimization involves careful data and task placement to...
International audienceThe MPI standard is a major contribution in the landscape of parallel programm...
International audienceThe cost of data movement has always been an important concern in high perform...
International audienceThe ordered read-write lock model (ORWL) is a modern framework that proposes h...
International audienceThe task-based approach has emerged as a viable way to effectively use modern ...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
Recently, multi-cores chips have become omnipresent in computer systems ranging from high-end server...
The cost of data movement has always been an important concern in high performance computing (HPC) s...
International audienceThe task-based approach has emerged as a viable way to effectively use modern ...