Dependences among loads and stores whose addresses are unknown hinder the extraction of instruction level parallelism during the execution of a sequential program. Such ambiguous memory dependences can be overcome by memory dependence speculation which enables a load or store to be speculatively executed before the addresses of all preceding loads and stores are known. Furthermore, multiple speculative stores to a memory location create multiple speculative versions of the location. Program order among the speculative versions must be tracked to maintain sequential semantics. A previously proposed approach, the Address Resolution Buffer(ARB) uses a centralized buffer to support speculative versions. Our proposal, called the Speculative Vers...
Current trends in processor design are pointing to deeper and wider pipelines and superscalar archit...
With processor vendors pursuing multicore products, often at the expense of the complexity and aggre...
With the advent of multicore processors, extracting thread level parallelism from a sequential progr...
Dependences among loads and stores whose addresses are unknown hinder the extraction of instruction ...
This work presents BMW, a new design for speculative implementations of memory consistency models in...
This article describes cache designs for efficiently supporting speculative techniques like transact...
Thread-level speculation provides architectural support to aggressively run hard-to-analyze code in ...
This work presents BMW, a new design for speculative implementations of memory consistency models in...
this paper, we introduce a novel taxonomy of approaches to buffer and manage multiversion speculativ...
Recent proposals for multithreaded architectures employ speculative execution to allow threads with ...
Maximal utilization of cores in multicore architectures is key to realize the potential performance ...
Recent proposals for multithreaded architectures employ speculative execution to allow threads with ...
Data dependence speculation allows a compiler to relax the constraint of data-independence to issue ...
With the advent of chip-multiprocessors (CMPs), Thread-Level Speculation (TLS) remains a promising t...
Recent research indicates that hardware can relax memory order speculatively to allow systems that i...
Current trends in processor design are pointing to deeper and wider pipelines and superscalar archit...
With processor vendors pursuing multicore products, often at the expense of the complexity and aggre...
With the advent of multicore processors, extracting thread level parallelism from a sequential progr...
Dependences among loads and stores whose addresses are unknown hinder the extraction of instruction ...
This work presents BMW, a new design for speculative implementations of memory consistency models in...
This article describes cache designs for efficiently supporting speculative techniques like transact...
Thread-level speculation provides architectural support to aggressively run hard-to-analyze code in ...
This work presents BMW, a new design for speculative implementations of memory consistency models in...
this paper, we introduce a novel taxonomy of approaches to buffer and manage multiversion speculativ...
Recent proposals for multithreaded architectures employ speculative execution to allow threads with ...
Maximal utilization of cores in multicore architectures is key to realize the potential performance ...
Recent proposals for multithreaded architectures employ speculative execution to allow threads with ...
Data dependence speculation allows a compiler to relax the constraint of data-independence to issue ...
With the advent of chip-multiprocessors (CMPs), Thread-Level Speculation (TLS) remains a promising t...
Recent research indicates that hardware can relax memory order speculatively to allow systems that i...
Current trends in processor design are pointing to deeper and wider pipelines and superscalar archit...
With processor vendors pursuing multicore products, often at the expense of the complexity and aggre...
With the advent of multicore processors, extracting thread level parallelism from a sequential progr...