This paper explores the interaction of value prediction with thread-level parallelism techniques, including multithreading and multiprocessing, where correctness is defined by a memory consistency model. Value prediction subtly interacts with the memory consistency model by allowing data dependent instructions to be reordered. We find that predicting a value and later verifying that the value eventually calculated is the same as the value predicted is not always sufficient. We present an example of a multithreaded pointer manipulation that can generate a surprising and erroneous result when value prediction is implemented without considering memory consistency correctness. We show that this problem can occur with real software, and we discu...
This work presents BMW, a new design for speculative implementations of memory consistency models in...
In this paper, we examined the behavior of three of the best performing branch prediction strategies...
Developing correct and performant concurrent systems is a major challenge. When programming an appli...
This paper explores the interaction of value prediction with thread-level parallelism techniques, in...
Value prediction attempts to eliminate true-data dependencies by dynamically predicting the outcome ...
Despite recent advances in high performance microprocessor architecture and compilation technologies...
Parallel systems that support the shared memory abstraction are becoming widely accepted in many are...
The ever-increasing computational power of contemporary microprocessors reduces the execution time s...
Speculative thread-level parallelism has been recently proposed as a source of parallelism to improv...
The ever-increasing computational power of contemporary microprocessors reduces the execution time s...
A fait l'objet d'une publication à "High Performance Computer Architecture (HPCA) 2014" Lien : http:...
Although currently available general purpose microprocessors feature more than 10 cores, many progra...
The memory consistency model of a shared-memory multiprocessor determines the extent to which memory...
International audienceDedicating more silicon area to single thread perfor-mance will necessarily be...
International audienceIncreasing instruction-level parallelism is regaining attractiveness within th...
This work presents BMW, a new design for speculative implementations of memory consistency models in...
In this paper, we examined the behavior of three of the best performing branch prediction strategies...
Developing correct and performant concurrent systems is a major challenge. When programming an appli...
This paper explores the interaction of value prediction with thread-level parallelism techniques, in...
Value prediction attempts to eliminate true-data dependencies by dynamically predicting the outcome ...
Despite recent advances in high performance microprocessor architecture and compilation technologies...
Parallel systems that support the shared memory abstraction are becoming widely accepted in many are...
The ever-increasing computational power of contemporary microprocessors reduces the execution time s...
Speculative thread-level parallelism has been recently proposed as a source of parallelism to improv...
The ever-increasing computational power of contemporary microprocessors reduces the execution time s...
A fait l'objet d'une publication à "High Performance Computer Architecture (HPCA) 2014" Lien : http:...
Although currently available general purpose microprocessors feature more than 10 cores, many progra...
The memory consistency model of a shared-memory multiprocessor determines the extent to which memory...
International audienceDedicating more silicon area to single thread perfor-mance will necessarily be...
International audienceIncreasing instruction-level parallelism is regaining attractiveness within th...
This work presents BMW, a new design for speculative implementations of memory consistency models in...
In this paper, we examined the behavior of three of the best performing branch prediction strategies...
Developing correct and performant concurrent systems is a major challenge. When programming an appli...