This paper introduces a learning-based framework for dynamic placement of threads of parallel applications to the cores of Non-Uniform Memory Access (NUMA) architectures. Adaptation takes place in two levels, where at the first level each thread independently decides on which group of cores (NUMA node) it will execute, and on the second level it decides to which particular core from the group it will be pinned. Naturally, these two adaptation levels run on different time-scales: a low-frequency switching for the NUMA-node adaptation, and a high-frequency switching for the CPU-node level adaptation. In addition, the learning dynamics have been designed to handle measurement noise and rapid variations in the performance of the threads. The ad...
Since multicore systems offer greater performance via parallelism, future computing is progressing t...
Multicore multiprocessors use Non Uniform Memory Ar-chitecture (NUMA) to improve their scalability. ...
International audienceExploiting the full computational power of current hierarchical multiprocessor...
This paper introduces a learning-based framework for dynamic placement of threads of parallel applic...
This paper introduces a reinforcement-learning based resource allocation framework for dynamic place...
Funding: This work has been supported by the European Union grant EU H2020-ICT-2014-1 project RePhra...
This paper describes a dynamic framework for mapping the threads of parallel applications to the com...
This paper introduces a resource allocation framework specifically tailored for addressing the probl...
This paper describes a dynamic framework for mapping the threads of parallel applications to the com...
International audienceNowadays, NUMA architectures are common in compute-intensive systems. Achievin...
In a non-uniform memory access machine, the placement of software threads to hardware cores can have...
Multicore multiprocessors use a Non Uniform Memory Architecture (NUMA) to improve their scalability....
Modern day hardware platforms are parallel and diverse, ranging from mobiles to data centers. Mains...
Parallel scientific programs executing in a NUMA environment are confronted with the problem of how ...
The problem of placement of threads, or virtual cores, on physical cores in a multicore system has b...
Since multicore systems offer greater performance via parallelism, future computing is progressing t...
Multicore multiprocessors use Non Uniform Memory Ar-chitecture (NUMA) to improve their scalability. ...
International audienceExploiting the full computational power of current hierarchical multiprocessor...
This paper introduces a learning-based framework for dynamic placement of threads of parallel applic...
This paper introduces a reinforcement-learning based resource allocation framework for dynamic place...
Funding: This work has been supported by the European Union grant EU H2020-ICT-2014-1 project RePhra...
This paper describes a dynamic framework for mapping the threads of parallel applications to the com...
This paper introduces a resource allocation framework specifically tailored for addressing the probl...
This paper describes a dynamic framework for mapping the threads of parallel applications to the com...
International audienceNowadays, NUMA architectures are common in compute-intensive systems. Achievin...
In a non-uniform memory access machine, the placement of software threads to hardware cores can have...
Multicore multiprocessors use a Non Uniform Memory Architecture (NUMA) to improve their scalability....
Modern day hardware platforms are parallel and diverse, ranging from mobiles to data centers. Mains...
Parallel scientific programs executing in a NUMA environment are confronted with the problem of how ...
The problem of placement of threads, or virtual cores, on physical cores in a multicore system has b...
Since multicore systems offer greater performance via parallelism, future computing is progressing t...
Multicore multiprocessors use Non Uniform Memory Ar-chitecture (NUMA) to improve their scalability. ...
International audienceExploiting the full computational power of current hierarchical multiprocessor...