Remote Direct Memory Access (RDMA) is becoming widely available in data centers. This technology allows a process to directly read and write the memory of a remote host, with a mechanism to control access permissions. In this paper, we study the fundamental power of these capabilities. We consider the well-known problem of achieving consensus despite failures, and find that RDMA can improve the inherent trade-off in distributed computing between failure resilience and performance. Specifically, we show that RDMA allows algorithms that simultaneously achieve high resilience and high performance, while traditional algorithms had to choose one or another. With Byzantine failures, we give an algorithm that only requires n \geq 2f_P + 1 processe...
Remote Direct Memory Access (RDMA) fabrics such as Infiniband and Converged Ethernet report latencie...
Distributed memory systems are becoming increasingly important since they provide a system-scale abs...
Abstract | The emergence of commercially-available net-work interface controllers (NICs) with remote...
The increasing amount of data that needs to be collected and analyzed requires large-scale datacente...
Remote Direct Memory Access (RDMA) is a networking protocol that provides high bandwidth and low lat...
Byzantine fault tolerance (BFT) protocols can mitigate attacks and errors and are increasingly inve...
Distributed data structures are key to implementing scalable applications for scientific simulations...
We will cover distributed memory programming of high-performance supercomputers and datacenter compu...
It is becoming increasingly popular for distributed systems to exploit offload to reduce load on the...
© 2019, is held by the owner/author(s). Highly available database systems rely on data replication t...
Remote Direct Memory Access (RDMA) is widely used in High-Performance Computing (HPC) while making i...
Distributed systems are commonly built under the assumption that the network is the primary bottlene...
Abstract. Remote Direct Memory Access (RDMA) is a technology to update a remote machine’s memory wit...
Modern critical computer applications often require continuous and correct operation despite the fai...
Part 5: HPCInternational audienceThe increasing complex tasks and growing size of data have necessit...
Remote Direct Memory Access (RDMA) fabrics such as Infiniband and Converged Ethernet report latencie...
Distributed memory systems are becoming increasingly important since they provide a system-scale abs...
Abstract | The emergence of commercially-available net-work interface controllers (NICs) with remote...
The increasing amount of data that needs to be collected and analyzed requires large-scale datacente...
Remote Direct Memory Access (RDMA) is a networking protocol that provides high bandwidth and low lat...
Byzantine fault tolerance (BFT) protocols can mitigate attacks and errors and are increasingly inve...
Distributed data structures are key to implementing scalable applications for scientific simulations...
We will cover distributed memory programming of high-performance supercomputers and datacenter compu...
It is becoming increasingly popular for distributed systems to exploit offload to reduce load on the...
© 2019, is held by the owner/author(s). Highly available database systems rely on data replication t...
Remote Direct Memory Access (RDMA) is widely used in High-Performance Computing (HPC) while making i...
Distributed systems are commonly built under the assumption that the network is the primary bottlene...
Abstract. Remote Direct Memory Access (RDMA) is a technology to update a remote machine’s memory wit...
Modern critical computer applications often require continuous and correct operation despite the fai...
Part 5: HPCInternational audienceThe increasing complex tasks and growing size of data have necessit...
Remote Direct Memory Access (RDMA) fabrics such as Infiniband and Converged Ethernet report latencie...
Distributed memory systems are becoming increasingly important since they provide a system-scale abs...
Abstract | The emergence of commercially-available net-work interface controllers (NICs) with remote...