Many future shared-memory multiprocessor servers will both target commercial workloads and use highly-integrated glueless designs. Implementing low-latency cache coherence in these systems is difficult, because traditional approaches either add indirection for common cache-to-cache misses (directory protocols) or require a totally-ordered interconnect (traditional snooping protocols). Unfortunately, totally-ordered interconnects are difficult to implement in glueless designs. An ideal coherence protocol would avoid indirections and interconnect ordering; however, such an approach introduces numerous protocol races that are difficult to resolve. We propose a new coherence framework to enable such protocols by separating performance from co...
We propose UNITD, a unified hardware coherence framework that integrates translation coherence into ...
In this paper we describe our experience with Teapot [7], a domain-specific language for writing cac...
This invited paper argues that to facilitate formal verification, multiprocessor systems should (1) ...
Many future shared-memory multiprocessor servers will both target commercial workloads and use highl...
The coherence protocol is a first-order design concern in multicore designs. Directory protocols are...
[EN] Token Coherence is a cache coherence protocol that simultaneously captures the best attributes ...
Commercial workload and technology trends are pushing existing shared-memory multiprocessor coherenc...
Abstract—As Internet and information technology have continued developing, the necessity for fast pa...
Token Coherence is a cache coherence protocol able to simultaneously capture the best attributes of ...
Cache coherence is one of the main challenges to tackle when designing a shared-memory multiprocesso...
Traditional coherence protocols present a set of difficult trade-offs: the reliance of snoopy protoc...
Cache coherence protocols play an important role in the performance of distributed and centralized s...
Improvements in semiconductor technology now enable Chip Multiprocessors (CMPs). As many future comp...
It is widely accepted that transient failures will appear more frequently in chips designed in the n...
With transistor miniaturization leading to an abundance of on-chip resources and uniprocessor design...
We propose UNITD, a unified hardware coherence framework that integrates translation coherence into ...
In this paper we describe our experience with Teapot [7], a domain-specific language for writing cac...
This invited paper argues that to facilitate formal verification, multiprocessor systems should (1) ...
Many future shared-memory multiprocessor servers will both target commercial workloads and use highl...
The coherence protocol is a first-order design concern in multicore designs. Directory protocols are...
[EN] Token Coherence is a cache coherence protocol that simultaneously captures the best attributes ...
Commercial workload and technology trends are pushing existing shared-memory multiprocessor coherenc...
Abstract—As Internet and information technology have continued developing, the necessity for fast pa...
Token Coherence is a cache coherence protocol able to simultaneously capture the best attributes of ...
Cache coherence is one of the main challenges to tackle when designing a shared-memory multiprocesso...
Traditional coherence protocols present a set of difficult trade-offs: the reliance of snoopy protoc...
Cache coherence protocols play an important role in the performance of distributed and centralized s...
Improvements in semiconductor technology now enable Chip Multiprocessors (CMPs). As many future comp...
It is widely accepted that transient failures will appear more frequently in chips designed in the n...
With transistor miniaturization leading to an abundance of on-chip resources and uniprocessor design...
We propose UNITD, a unified hardware coherence framework that integrates translation coherence into ...
In this paper we describe our experience with Teapot [7], a domain-specific language for writing cac...
This invited paper argues that to facilitate formal verification, multiprocessor systems should (1) ...