International audienceIn modern parallel architectures, memory accesses represent a common bottleneck. Thus, optimizing the way applications access the memory is an important way to improve performance and energy consumption. Memory accesses are even more important with NUMA machines, as the access time to data depends on its location in the memory. Many efforts were made to develop adaptive tools to improve memory accesses at the runtime by optimizing the mapping of data and threads to NUMA nodes. However, theses tools are not able to change the memory access pattern of the original application, therefore a code written without considering memory performance might not benefit from them. Moreover, automatic mapping tools take time to conver...
Nowadays, on hierarchical shared memory multiprocessors with Non-Uniform Memory Access (NUMA), the n...
Shared memory systems are becoming increasingly complex as they typically integrate several storage ...
A multiprocessor system with uniform memory access is difficult to scale due to the increasing conte...
International audienceNon Uniform Memory Access (NUMA) architectures are nowadays common for running...
Shared memory applications running transparently on top of NUMA architectures often face severe perf...
In modern parallel architectures, memory accesses represent a commonbottleneck. Thus, optimizing the...
The available memory bandwidth of existing high performance computing platforms turns out as being m...
Multi-core nodes with Non-Uniform Memory Access (NUMA) are now a common architecture for high perfor...
Nowadays the evolution of High Performance Computing follows the needs of numerical simulations.Thes...
Multi-core platforms with non-uniform memory access (NUMA) design are now a common resource in High ...
The latency of memory access times is hence non-uniform, because it depends on where the request ori...
This paper introduces two novel algorithms for thread migrations, named CIMAR (Core-aware Interchang...
Current multi-socket systems have complex memory hierarchies with significant Non-Uniform Memory Acc...
Modern multicore systems are based on a Non-Uniform Memory Access (NUMA) design. In a NUMA system, c...
International audienceDynamic task-parallel programming models are popular on shared-memory systems,...
Nowadays, on hierarchical shared memory multiprocessors with Non-Uniform Memory Access (NUMA), the n...
Shared memory systems are becoming increasingly complex as they typically integrate several storage ...
A multiprocessor system with uniform memory access is difficult to scale due to the increasing conte...
International audienceNon Uniform Memory Access (NUMA) architectures are nowadays common for running...
Shared memory applications running transparently on top of NUMA architectures often face severe perf...
In modern parallel architectures, memory accesses represent a commonbottleneck. Thus, optimizing the...
The available memory bandwidth of existing high performance computing platforms turns out as being m...
Multi-core nodes with Non-Uniform Memory Access (NUMA) are now a common architecture for high perfor...
Nowadays the evolution of High Performance Computing follows the needs of numerical simulations.Thes...
Multi-core platforms with non-uniform memory access (NUMA) design are now a common resource in High ...
The latency of memory access times is hence non-uniform, because it depends on where the request ori...
This paper introduces two novel algorithms for thread migrations, named CIMAR (Core-aware Interchang...
Current multi-socket systems have complex memory hierarchies with significant Non-Uniform Memory Acc...
Modern multicore systems are based on a Non-Uniform Memory Access (NUMA) design. In a NUMA system, c...
International audienceDynamic task-parallel programming models are popular on shared-memory systems,...
Nowadays, on hierarchical shared memory multiprocessors with Non-Uniform Memory Access (NUMA), the n...
Shared memory systems are becoming increasingly complex as they typically integrate several storage ...
A multiprocessor system with uniform memory access is difficult to scale due to the increasing conte...