Multiprocessor memory reference traces provide a wealth of information on the behavior of parallel programs. We have used this information to explore the relationship between kernel-based NUMA management policies and multiprocessor memory architecture. Our trace analysis techniques employ an off-line, optimal cost policy as a baseline against which to compare on-line policies, and as a policyinsensitive tool for evaluating architectural design alternatives. We compare the performance of our optimal policy with that of three implementable policies (two of which appear in previous work), on a variety of applications, with varying relative speeds for page moves and local, global, and remote memory references. Our results indicate that a good N...
The invention, acceptance, and proliferation of multiprocessors are primarily a result of the quest ...
Large-scale shared-memory multiprocessors such as the BBN Butterfly and IBM RP3 introduce a new leve...
Non-Uniform Memory Access (NUMA) architectures make it possible to build large-scale shared memory m...
There are three major classes of MIMD multiprocessors: cache-coherent machines, NUMA (non-uniform me...
Abstract—An important aspect of workload characterization is understanding memory system performance...
Cache Coherent Non-Uniform Memory Access (CC-NUMA) architectures have received strong interests from...
PLATINUM is an operating system kernel with a novel memory management system for N on-Uniform Memory...
Non-uniform memory access (NUMA) architectures are modern shared-memory, multi-core machines offerin...
In scalable multiprocessor architectures, the times required for a processor to access various porti...
Nonuniform memory access time (referred to as NUMA) is an important feature in the design of large s...
Shared memory applications running transparently on top of NUMA architectures often face severe perf...
The latency of memory access times is hence non-uniform, because it depends on where the request ori...
The choice of a good data distribution scheme is critical to performance of data-parallel applicatio...
In recent years, much effort has been devoted to analyzing the performance of distributed memory sys...
International audienceNon Uniform Memory Access (NUMA) architectures are nowadays common for running...
The invention, acceptance, and proliferation of multiprocessors are primarily a result of the quest ...
Large-scale shared-memory multiprocessors such as the BBN Butterfly and IBM RP3 introduce a new leve...
Non-Uniform Memory Access (NUMA) architectures make it possible to build large-scale shared memory m...
There are three major classes of MIMD multiprocessors: cache-coherent machines, NUMA (non-uniform me...
Abstract—An important aspect of workload characterization is understanding memory system performance...
Cache Coherent Non-Uniform Memory Access (CC-NUMA) architectures have received strong interests from...
PLATINUM is an operating system kernel with a novel memory management system for N on-Uniform Memory...
Non-uniform memory access (NUMA) architectures are modern shared-memory, multi-core machines offerin...
In scalable multiprocessor architectures, the times required for a processor to access various porti...
Nonuniform memory access time (referred to as NUMA) is an important feature in the design of large s...
Shared memory applications running transparently on top of NUMA architectures often face severe perf...
The latency of memory access times is hence non-uniform, because it depends on where the request ori...
The choice of a good data distribution scheme is critical to performance of data-parallel applicatio...
In recent years, much effort has been devoted to analyzing the performance of distributed memory sys...
International audienceNon Uniform Memory Access (NUMA) architectures are nowadays common for running...
The invention, acceptance, and proliferation of multiprocessors are primarily a result of the quest ...
Large-scale shared-memory multiprocessors such as the BBN Butterfly and IBM RP3 introduce a new leve...
Non-Uniform Memory Access (NUMA) architectures make it possible to build large-scale shared memory m...