Store misses cause significant delays in shared-memory multiprocessors because of limited store buffering and ordering constraints required for proper synchronization. Today, programmers must choose from a spectrum of memory consistency models that reduce store stalls at the cost of increased programming complexity. Prior research suggests that the performance gap among consistency models can be closed through speculation—enforcing order only when dynamically necessary. Unfortunately, past designs either provide insufficient buffering, replace all stores with read-modify-write operations, and/or recover from ordering violations via impractical fine-grained rollback mechanisms. We propose two mechanisms that, together, enable store-wait–free...
Cache coherence protocols of current shared-memory multiprocessors are difficult to verify. Our prev...
For performance reasons, modern multiprocessors implement relaxed memory consistency models that adm...
This work presents BMW, a new design for speculative implementations of memory consistency models in...
The most commonly assumed memory consistency model for shared-memory multiprocessors is Sequential C...
Abstract. When verifying a concurrent program, it is usual to assume that memory is sequentially con...
We present a non-speculative solution for a coalescing store buffer in total store order (TSO) consi...
Correctly synchronizing multithreaded programs is challenging, and errors can lead to program failur...
Various memory consistency model implementations (e.g., x86, SPARC) willfully allow a core to see it...
Correctly synchronizing multithreaded programs is challenging, and errors can lead to program failur...
Abstract. Simple and unified non-operational specifications of the three memory consistency models T...
Classical model-checking tools verify concurrent programs under the tra-ditional Sequential Consiste...
In an out-of-order core, the load queue (LQ), the store queue (SQ), and the store buffer (SB) are re...
Recent research indicates that hardware can relax memory order speculatively to allow systems that i...
Shared memory has been widely adopted as the primary system level programming abstraction on modern ...
Although the sequential consistency (SC) model is the most intu-itive, processor designers often cho...
Cache coherence protocols of current shared-memory multiprocessors are difficult to verify. Our prev...
For performance reasons, modern multiprocessors implement relaxed memory consistency models that adm...
This work presents BMW, a new design for speculative implementations of memory consistency models in...
The most commonly assumed memory consistency model for shared-memory multiprocessors is Sequential C...
Abstract. When verifying a concurrent program, it is usual to assume that memory is sequentially con...
We present a non-speculative solution for a coalescing store buffer in total store order (TSO) consi...
Correctly synchronizing multithreaded programs is challenging, and errors can lead to program failur...
Various memory consistency model implementations (e.g., x86, SPARC) willfully allow a core to see it...
Correctly synchronizing multithreaded programs is challenging, and errors can lead to program failur...
Abstract. Simple and unified non-operational specifications of the three memory consistency models T...
Classical model-checking tools verify concurrent programs under the tra-ditional Sequential Consiste...
In an out-of-order core, the load queue (LQ), the store queue (SQ), and the store buffer (SB) are re...
Recent research indicates that hardware can relax memory order speculatively to allow systems that i...
Shared memory has been widely adopted as the primary system level programming abstraction on modern ...
Although the sequential consistency (SC) model is the most intu-itive, processor designers often cho...
Cache coherence protocols of current shared-memory multiprocessors are difficult to verify. Our prev...
For performance reasons, modern multiprocessors implement relaxed memory consistency models that adm...
This work presents BMW, a new design for speculative implementations of memory consistency models in...