Multi-core nodes with Non-Uniform Memory Access (NUMA) are now a common architecture for high performance computing. On such NUMA nodes, the shared memory is physically distributed into memory banks connected by a network. Owing to this, memory access costs may vary depending on the distance between the processing unit and the memory bank. Therefore, a key element in improving the performance on these machines is dealing with memory affinity. We propose a NUMA-aware load balancer that combines the information about the NUMA topology with the statistics captured by the Charm++ runtime system. We present speedups of up to 1.8 for synthetic benchmarks running on different NUMA platforms. We also show improvements over existing load balancing s...
Abstract—An important aspect of workload characterization is understanding memory system performance...
As the adoption of Big Data technologies becomes the norm in an increasing number of scenarios, ther...
International audienceNowadays, on Multi-core Multiprocessors with Hierarchical Memory (Non-Uniform ...
Multi-core nodes with Non-Uniform Memory Access (NUMA) are now a common architecture for high perfor...
International audienceMulti-core compute nodes with non-uniform memory access (NUMA) are now a commo...
Abstract—Multi-core compute nodes with non-uniform mem-ory access (NUMA) are now a common architectu...
Multi-core platforms with non-uniform memory access (NUMA) design are now a common resource in High ...
The latency of memory access times is hence non-uniform, because it depends on where the request ori...
Modern multicore systems are based on a Non-Uniform Memory Access (NUMA) design. In a NUMA system, c...
This paper introduces two novel algorithms for thread migrations, named CIMAR (Core-aware Interchang...
Processors with multiple sockets or chiplets are becoming more conventional. These kinds of processo...
peer reviewedDuring the parallel execution of queries in Non-Uniform Memory Access (NUMA) sys...
Non-uniform memory access (NUMA) architectures are modern shared-memory, multi-core machines offerin...
Embedded manycore architectures are often organized as fabrics of tightly-coupled shared memory clus...
As the adoption of Big Data technologies becomes the norm in an increasing number of scenarios, ther...
Abstract—An important aspect of workload characterization is understanding memory system performance...
As the adoption of Big Data technologies becomes the norm in an increasing number of scenarios, ther...
International audienceNowadays, on Multi-core Multiprocessors with Hierarchical Memory (Non-Uniform ...
Multi-core nodes with Non-Uniform Memory Access (NUMA) are now a common architecture for high perfor...
International audienceMulti-core compute nodes with non-uniform memory access (NUMA) are now a commo...
Abstract—Multi-core compute nodes with non-uniform mem-ory access (NUMA) are now a common architectu...
Multi-core platforms with non-uniform memory access (NUMA) design are now a common resource in High ...
The latency of memory access times is hence non-uniform, because it depends on where the request ori...
Modern multicore systems are based on a Non-Uniform Memory Access (NUMA) design. In a NUMA system, c...
This paper introduces two novel algorithms for thread migrations, named CIMAR (Core-aware Interchang...
Processors with multiple sockets or chiplets are becoming more conventional. These kinds of processo...
peer reviewedDuring the parallel execution of queries in Non-Uniform Memory Access (NUMA) sys...
Non-uniform memory access (NUMA) architectures are modern shared-memory, multi-core machines offerin...
Embedded manycore architectures are often organized as fabrics of tightly-coupled shared memory clus...
As the adoption of Big Data technologies becomes the norm in an increasing number of scenarios, ther...
Abstract—An important aspect of workload characterization is understanding memory system performance...
As the adoption of Big Data technologies becomes the norm in an increasing number of scenarios, ther...
International audienceNowadays, on Multi-core Multiprocessors with Hierarchical Memory (Non-Uniform ...