International audienceNowadays, NUMA architectures are common in compute-intensive systems. Achieving high performance for multi-threaded application requires both a careful placement of threads on computing units and a thorough allocation of data in memory. Finding such a placement is a hard problem to solve, because performance depends on complex interactions in several layers of the memory hierarchy. In this paper we propose a black-box approach to decide if an application execution time can be impacted by the placement of its threads and data, and in such a case, to choose the best placement strategy to adopt. We show that it is possible to reach near-optimal placement policy selection. Furthermore, solutions work across several recent ...
Our work addresses the problem of placement of threads, or virtual cores, onto physical cores in a m...
The evolution of microprocessors has lead to a situation where more memory is integrated closer to ...
Parallel scientific programs executing in a NUMA environment are confronted with the problem of how ...
International audienceNowadays, NUMA architectures are common in compute-intensive systems. Achievin...
Multicore multiprocessors use a Non Uniform Memory Architecture (NUMA) to improve their scalability....
Multicore multiprocessors use Non Uniform Memory Ar-chitecture (NUMA) to improve their scalability. ...
The problem of placement of threads, or virtual cores, on physical cores in a multicore system has b...
This paper introduces a learning-based framework for dynamic placement of threads of parallel applic...
This paper introduces a learning-based framework for dynamic placement of threads of parallel applic...
In a non-uniform memory access machine, the placement of software threads to hardware cores can have...
It is well known that the placement of threads and memory plays a crucial role for performance on NU...
This paper introduces a reinforcement-learning based resource allocation framework for dynamic place...
Current multi-socket systems have complex memory hierarchies with significant Non-Uniform Memory Acc...
International audienceNon Uniform Memory Access (NUMA) architectures are nowadays common for running...
The introduction of multicore/multithreaded processors, comprised of a large number of hardware cont...
Our work addresses the problem of placement of threads, or virtual cores, onto physical cores in a m...
The evolution of microprocessors has lead to a situation where more memory is integrated closer to ...
Parallel scientific programs executing in a NUMA environment are confronted with the problem of how ...
International audienceNowadays, NUMA architectures are common in compute-intensive systems. Achievin...
Multicore multiprocessors use a Non Uniform Memory Architecture (NUMA) to improve their scalability....
Multicore multiprocessors use Non Uniform Memory Ar-chitecture (NUMA) to improve their scalability. ...
The problem of placement of threads, or virtual cores, on physical cores in a multicore system has b...
This paper introduces a learning-based framework for dynamic placement of threads of parallel applic...
This paper introduces a learning-based framework for dynamic placement of threads of parallel applic...
In a non-uniform memory access machine, the placement of software threads to hardware cores can have...
It is well known that the placement of threads and memory plays a crucial role for performance on NU...
This paper introduces a reinforcement-learning based resource allocation framework for dynamic place...
Current multi-socket systems have complex memory hierarchies with significant Non-Uniform Memory Acc...
International audienceNon Uniform Memory Access (NUMA) architectures are nowadays common for running...
The introduction of multicore/multithreaded processors, comprised of a large number of hardware cont...
Our work addresses the problem of placement of threads, or virtual cores, onto physical cores in a m...
The evolution of microprocessors has lead to a situation where more memory is integrated closer to ...
Parallel scientific programs executing in a NUMA environment are confronted with the problem of how ...