Historically, processor accesses to memory-mapped device registers have been marked uncachable to insure their visibility to the device. The ubiquity of snooping cache coherence, however, makes it possible for processors and devices to interact with cachable, coherent memory operations. Using coherence can improve performance by facilitating burst transfers of whole cache blocks and reducing control overheads (e.g., for polling). This paper begins an exploration of network interfaces (NIs) that use coherence---coherent network interfaces (CNIs)---to improve communication performance. We restrict this study to NI/ CNIs that reside on coherent memory or I/O buses, to NI/CNIs that are much simpler than processors, and to the performance of fin...
Abstract—Chip Multiprocessor Systems (CMPs) rely on a cache coherency protocol to maintain memory ac...
[EN] Future chip multiprocessors will include hundreds of cores organised in a tile-based design pat...
[[abstract]]A cache coherence protocol for a multiprocessor system. Each processor in the system has...
Using coherence can improve performance by facilitating burst transfers of whole cache blocks and re...
Abstract—As Internet and information technology have continued developing, the necessity for fast pa...
Recent developments in shared-memory multiprocessor systems advocate using off-the-shelf hardware to...
With transistor miniaturization leading to an abundance of on-chip resources and uniprocessor design...
Thesis (Ph. D.)--University of Washington, 1997Two recent trends are affecting the design of medium-...
We argue that OS-provided data coherence on non-cache-coherent NUMA multiprocessors (machines with a...
Abstract—Scalable distributed shared-memory architectures rely on coherence controllers on each proc...
Design complexity and limited power budget are causing the number of cores on the same chip to grow ...
Software distributed shared memory (DSM) platforms on networks of workstations tolerate large networ...
Shared memory systems generally support consumerinitiated communication; when a process needs data,...
Cache coherence protocols for shared-memory multiprocessors use invalidations or updates to maintain...
In a multiprocessor system on chip private caches introduce the cache coherence problem; because pro...
Abstract—Chip Multiprocessor Systems (CMPs) rely on a cache coherency protocol to maintain memory ac...
[EN] Future chip multiprocessors will include hundreds of cores organised in a tile-based design pat...
[[abstract]]A cache coherence protocol for a multiprocessor system. Each processor in the system has...
Using coherence can improve performance by facilitating burst transfers of whole cache blocks and re...
Abstract—As Internet and information technology have continued developing, the necessity for fast pa...
Recent developments in shared-memory multiprocessor systems advocate using off-the-shelf hardware to...
With transistor miniaturization leading to an abundance of on-chip resources and uniprocessor design...
Thesis (Ph. D.)--University of Washington, 1997Two recent trends are affecting the design of medium-...
We argue that OS-provided data coherence on non-cache-coherent NUMA multiprocessors (machines with a...
Abstract—Scalable distributed shared-memory architectures rely on coherence controllers on each proc...
Design complexity and limited power budget are causing the number of cores on the same chip to grow ...
Software distributed shared memory (DSM) platforms on networks of workstations tolerate large networ...
Shared memory systems generally support consumerinitiated communication; when a process needs data,...
Cache coherence protocols for shared-memory multiprocessors use invalidations or updates to maintain...
In a multiprocessor system on chip private caches introduce the cache coherence problem; because pro...
Abstract—Chip Multiprocessor Systems (CMPs) rely on a cache coherency protocol to maintain memory ac...
[EN] Future chip multiprocessors will include hundreds of cores organised in a tile-based design pat...
[[abstract]]A cache coherence protocol for a multiprocessor system. Each processor in the system has...