Remote Direct Memory Access (RDMA) is expected to be an integral communication mechanism for future exascale systems - enabling asynchronous data transfers, so that applications may fully utilize CPU resources while simultaneously sharing data amongst remote nodes. In this work we examine Network-induced Memory Contention (NiMC) on Infiniband networks. We expose the interactions between RDMA, main-memory and cache, when applications and out-of-band services compete for memory resources. We then explore NiMC's resulting impact on application-level performance. For a range of hardware technologies and HPC workloads, we quantify NiMC and show that NiMC's impact grows with scale resulting in up to 3X performance degradation at scales as small a...
Remote Direct Memory Access (RDMA) fabrics such as Infiniband and Converged Ethernet report latencie...
It is well known that contention is one of the factors that limit the performance of high performanc...
International audienceTo amortize the cost of MPI communications, distributed parallel HPC applicati...
Remote Direct Memory Access (RDMA) is expected to be an integral communication mechanism for future ...
On multicore processors, co-executing applications compete for shared resources, such as cache capac...
: Many research results in recent years have focused on the design of distributed shared memory (DSM...
Remote Direct Memory Access (RDMA) is becoming widely available in data centers. This technology all...
International audienceMulti-core clusters are cost-effective clusters largely used in high-performan...
International audienceTo amortize the cost of MPI communications, distributed parallel HPC applicati...
CC-NUMA architectures have become extremely popular by providing fast and transparent access to data...
Disaggregated memory has recently been proposed as a way to allow flexible and fine-grained allocati...
International audienceOverlapping communications with computations in distributed applications shoul...
International audienceIn-memory storage systems emerged as a de-facto building block for today's lar...
Remote Direct Memory Access (RDMA) is a networking protocol that provides high bandwidth and low lat...
The compute requirements associated with the TCP/IP protocol suite have been previously studied by a...
Remote Direct Memory Access (RDMA) fabrics such as Infiniband and Converged Ethernet report latencie...
It is well known that contention is one of the factors that limit the performance of high performanc...
International audienceTo amortize the cost of MPI communications, distributed parallel HPC applicati...
Remote Direct Memory Access (RDMA) is expected to be an integral communication mechanism for future ...
On multicore processors, co-executing applications compete for shared resources, such as cache capac...
: Many research results in recent years have focused on the design of distributed shared memory (DSM...
Remote Direct Memory Access (RDMA) is becoming widely available in data centers. This technology all...
International audienceMulti-core clusters are cost-effective clusters largely used in high-performan...
International audienceTo amortize the cost of MPI communications, distributed parallel HPC applicati...
CC-NUMA architectures have become extremely popular by providing fast and transparent access to data...
Disaggregated memory has recently been proposed as a way to allow flexible and fine-grained allocati...
International audienceOverlapping communications with computations in distributed applications shoul...
International audienceIn-memory storage systems emerged as a de-facto building block for today's lar...
Remote Direct Memory Access (RDMA) is a networking protocol that provides high bandwidth and low lat...
The compute requirements associated with the TCP/IP protocol suite have been previously studied by a...
Remote Direct Memory Access (RDMA) fabrics such as Infiniband and Converged Ethernet report latencie...
It is well known that contention is one of the factors that limit the performance of high performanc...
International audienceTo amortize the cost of MPI communications, distributed parallel HPC applicati...