The advances in semiconductor technology have set the shared memory server trend towards processors with multiple cores per die and multiple threads per core. This paper presents simple hardware primitives en-abling flexible and low complexity multi-chip designs supporting an efficient inter-node coherence protocol run in software. The design is based on two node per-mission bits per cache line and a new way to decouple the intra-chip coherence protocol from the inter-node coherence protocol. The protocol implementation en-ables the system to cache remote data in the local mem-ory system with no additional hardware support. Our evaluation is based on detailed full system sim-ulation of both commercial and HPC workloads. We compare a low-com...
Thesis (Ph. D.)--University of Washington, 1997Two recent trends are affecting the design of medium-...
Abstract—Scalable distributed shared-memory architectures rely on coherence controllers on each proc...
Scalable distributed shared-memory architectures rely on coher-ence controllers on each processing n...
Shared memory is widely regarded as a more intuitive model than message passing for the development ...
Journal ArticleImprovements in semiconductor technology have made it possible to include multiple p...
Many-core architectures provide an efficient way of harnessing the increasing numbers of transistors...
Multi-core architectures have emerged as the best alternative to take advantage of the increas-ing n...
The interconnect mechanisms (shared bus or crossbar) used in current chip-multiprocessors (CMPs) are...
Plentiful research has addressed low-complexity software-based shared-memory systems since the idea ...
Abstract The interconnect mechanisms (shared bus or crossbar) used in current chip-multiprocessors (...
Many-core architectures provide an efficient way of harnessing the growing numbers of transistors av...
Shared-memory architectures represent a class of parallel computer systems commonly used in the comm...
It is widely accepted that transient failures will appear more frequently in chips designed in the n...
As the number of cores increases on chip multiprocessors, coherence is fast becoming a central issue...
On the road to computer systems able to support the requirements of exascale applications, Chip Mult...
Thesis (Ph. D.)--University of Washington, 1997Two recent trends are affecting the design of medium-...
Abstract—Scalable distributed shared-memory architectures rely on coherence controllers on each proc...
Scalable distributed shared-memory architectures rely on coher-ence controllers on each processing n...
Shared memory is widely regarded as a more intuitive model than message passing for the development ...
Journal ArticleImprovements in semiconductor technology have made it possible to include multiple p...
Many-core architectures provide an efficient way of harnessing the increasing numbers of transistors...
Multi-core architectures have emerged as the best alternative to take advantage of the increas-ing n...
The interconnect mechanisms (shared bus or crossbar) used in current chip-multiprocessors (CMPs) are...
Plentiful research has addressed low-complexity software-based shared-memory systems since the idea ...
Abstract The interconnect mechanisms (shared bus or crossbar) used in current chip-multiprocessors (...
Many-core architectures provide an efficient way of harnessing the growing numbers of transistors av...
Shared-memory architectures represent a class of parallel computer systems commonly used in the comm...
It is widely accepted that transient failures will appear more frequently in chips designed in the n...
As the number of cores increases on chip multiprocessors, coherence is fast becoming a central issue...
On the road to computer systems able to support the requirements of exascale applications, Chip Mult...
Thesis (Ph. D.)--University of Washington, 1997Two recent trends are affecting the design of medium-...
Abstract—Scalable distributed shared-memory architectures rely on coherence controllers on each proc...
Scalable distributed shared-memory architectures rely on coher-ence controllers on each processing n...