Funding: This work has been supported by the European Union grant EU H2020-ICT-2014-1 project RePhrase (No. 644235). It has also been partially supported by the Austrian Ministry for Transport, Innovation and Technology, the Federal Ministry of Science, Research and Economy, and the Province of Upper Austria in the frame of the COMET center SCCH.This paper introduces a learning-based framework for dynamic placement of threads of parallel applications to the cores of Non-Uniform Memory Access (NUMA) architectures. Adaptation takes place in two levels, where at the first level each thread independently decides on which group of cores (NUMA node) it will execute, and on the second level it decides to which particular core from the group it wil...
In a non-uniform memory access machine, the placement of software threads to hardware cores can have...
International audienceDynamic task-parallel programming models are popular on shared-memory systems,...
International audienceNowadays shared memory HPC platforms expose a large number of cores organized ...
This paper introduces a learning-based framework for dynamic placement of threads of parallel applic...
This paper introduces a learning-based framework for dynamic placement of threads of parallel applic...
This paper introduces a reinforcement-learning based resource allocation framework for dynamic place...
Funding: This work has been partially supported by the European Union grant EU H2020-ICT-2014-1 proj...
This paper introduces a resource allocation framework specifically tailored for addressing the probl...
This paper describes a dynamic framework for mapping the threads of parallel applications to the com...
Modern day hardware platforms are parallel and diverse, ranging from mobiles to data centers. Mains...
International audienceNowadays, NUMA architectures are common in compute-intensive systems. Achievin...
International audienceExploiting the full computational power of current hierarchical multiprocessor...
International audienceApproaching the theoretical performance of hierarchical multicore machines req...
Within the last decade, microprocessor development reached a point at which higher clock rates and m...
Since multicore systems offer greater performance via parallelism, future computing is progressing t...
In a non-uniform memory access machine, the placement of software threads to hardware cores can have...
International audienceDynamic task-parallel programming models are popular on shared-memory systems,...
International audienceNowadays shared memory HPC platforms expose a large number of cores organized ...
This paper introduces a learning-based framework for dynamic placement of threads of parallel applic...
This paper introduces a learning-based framework for dynamic placement of threads of parallel applic...
This paper introduces a reinforcement-learning based resource allocation framework for dynamic place...
Funding: This work has been partially supported by the European Union grant EU H2020-ICT-2014-1 proj...
This paper introduces a resource allocation framework specifically tailored for addressing the probl...
This paper describes a dynamic framework for mapping the threads of parallel applications to the com...
Modern day hardware platforms are parallel and diverse, ranging from mobiles to data centers. Mains...
International audienceNowadays, NUMA architectures are common in compute-intensive systems. Achievin...
International audienceExploiting the full computational power of current hierarchical multiprocessor...
International audienceApproaching the theoretical performance of hierarchical multicore machines req...
Within the last decade, microprocessor development reached a point at which higher clock rates and m...
Since multicore systems offer greater performance via parallelism, future computing is progressing t...
In a non-uniform memory access machine, the placement of software threads to hardware cores can have...
International audienceDynamic task-parallel programming models are popular on shared-memory systems,...
International audienceNowadays shared memory HPC platforms expose a large number of cores organized ...