Responding to the "datacenter tax" and "killer microseconds" problems for datacenter applications, diverse solutions including Smart NIC-based ones have been proposed. Nonetheless, they often suffer from high overhead of communications over network and/or PCIe links. To tackle the limitations of the current solutions, this paper proposes ORCA, a holistic network and architecture co-design solution that leverages current RDMA and emerging cache-coherent off-chip interconnect technologies. Specifically, ORCA consists of four hardware and software components: (1) unified abstraction of inter- and intra-machine communications managed by one-sided RDMA write and cache-coherent memory write; (2) efficient notification of requests to accelerators ...
With the emergence of data-intensive applications, recent years have seen a fast-growing volume of I...
Orca is a language for implementing parallel applications on loosely coupled distri-buted systems. U...
Aiming to solve the low utilization and high operational cost in current data centers and the divers...
Orca is a portable, object-based distributed shared memory (DSM) system. This article studies and ev...
Modern datacenters are the foundation of large scale Internet services, such as search engines, clou...
iii Modern datacenters utilize traditional Ethernet networks to connect hundreds or thousands of mac...
Remote Direct Memory Access (RDMA) fabrics such as Infiniband and Converged Ethernet report latencie...
The paper introduces Network-on-Chip (NoC) design methodology and low cost mechanisms for supporting...
International audienceExploiting at best every bit of memory on chip is a must for finding the best ...
Global interconnect becomes the delay bottleneck in microprocessor designs, and latency for large on...
In this paper, we present a hierarchical Data Cache Ar-chitecture called DCA to effectively slash lo...
Efficient resource utilization requires that emerging datacenter interconnects support both high per...
CPU and GPU platforms may not be the best options for many emerging compute patterns, which led to a...
Datacenters are vitally important in the information era in which we currently live. Datacenters are...
We introduce the concept of deadlock-free migration-based coherent shared memory to the NUCA family ...
With the emergence of data-intensive applications, recent years have seen a fast-growing volume of I...
Orca is a language for implementing parallel applications on loosely coupled distri-buted systems. U...
Aiming to solve the low utilization and high operational cost in current data centers and the divers...
Orca is a portable, object-based distributed shared memory (DSM) system. This article studies and ev...
Modern datacenters are the foundation of large scale Internet services, such as search engines, clou...
iii Modern datacenters utilize traditional Ethernet networks to connect hundreds or thousands of mac...
Remote Direct Memory Access (RDMA) fabrics such as Infiniband and Converged Ethernet report latencie...
The paper introduces Network-on-Chip (NoC) design methodology and low cost mechanisms for supporting...
International audienceExploiting at best every bit of memory on chip is a must for finding the best ...
Global interconnect becomes the delay bottleneck in microprocessor designs, and latency for large on...
In this paper, we present a hierarchical Data Cache Ar-chitecture called DCA to effectively slash lo...
Efficient resource utilization requires that emerging datacenter interconnects support both high per...
CPU and GPU platforms may not be the best options for many emerging compute patterns, which led to a...
Datacenters are vitally important in the information era in which we currently live. Datacenters are...
We introduce the concept of deadlock-free migration-based coherent shared memory to the NUCA family ...
With the emergence of data-intensive applications, recent years have seen a fast-growing volume of I...
Orca is a language for implementing parallel applications on loosely coupled distri-buted systems. U...
Aiming to solve the low utilization and high operational cost in current data centers and the divers...