We re-visit the issue of hardware consistency models in the new context of massively-threaded throughput-oriented processors (MTTOPs). A prominent example of an MTTOP is a GPGPU, but other examples include Intel’s MIC architecture and some recent academic designs. MTTOPs differ from CPUs in many significant ways, including their ability to tolerate latency, their memory system organization, and the characteristics of the software they run. We compare implementations of various hardware consistency models for MTTOPs in terms of performance, energy-efficiency, hardware complexity, and programmability. Our results show that the choice of hardware consistency model has a surprisingly minimal impact on performance and thus the decision should be...
Graphics Processing Units (GPUs) have been shown to be effective at achieving large speedups over co...
Abstract. The problem of verifying multi-threaded execution against the memory consistency model of ...
Modern memory consistency models are complex, and it is difficult to reason about the relaxed behavi...
The memory consistency model of a shared-memory multiprocessor determines the extent to which memory...
Parallel systems that support the shared memory abstraction are becoming widely accepted in many are...
The memory consistency model supported by a multiprocessor architecture determines the amount of buf...
I have read the thesis of Tyler Sorensen in its final form and have found that (1) its format, citat...
The memory consistency model (or memory model) of a shared-memory multiprocessor system influences ...
This paper discusses memory consistency models and their influence on software in the context of par...
The shared memory systems should support parallelization at the computation (multi-core), communicat...
This paper studies realization and performance comparison of the sequential and weak consistency mod...
Today’s mobile, desktop, and server processors are heterogeneous, consisting not only of CPUs but al...
AbstractA number of highly-threaded, many-core architectures hide memory-access latency by low-overh...
Variation in performance and power across manufactured parts and their operating conditions is an ac...
Abstract—Contemporary microprocessors use relaxed mem-ory consistency models to allow for aggressive...
Graphics Processing Units (GPUs) have been shown to be effective at achieving large speedups over co...
Abstract. The problem of verifying multi-threaded execution against the memory consistency model of ...
Modern memory consistency models are complex, and it is difficult to reason about the relaxed behavi...
The memory consistency model of a shared-memory multiprocessor determines the extent to which memory...
Parallel systems that support the shared memory abstraction are becoming widely accepted in many are...
The memory consistency model supported by a multiprocessor architecture determines the amount of buf...
I have read the thesis of Tyler Sorensen in its final form and have found that (1) its format, citat...
The memory consistency model (or memory model) of a shared-memory multiprocessor system influences ...
This paper discusses memory consistency models and their influence on software in the context of par...
The shared memory systems should support parallelization at the computation (multi-core), communicat...
This paper studies realization and performance comparison of the sequential and weak consistency mod...
Today’s mobile, desktop, and server processors are heterogeneous, consisting not only of CPUs but al...
AbstractA number of highly-threaded, many-core architectures hide memory-access latency by low-overh...
Variation in performance and power across manufactured parts and their operating conditions is an ac...
Abstract—Contemporary microprocessors use relaxed mem-ory consistency models to allow for aggressive...
Graphics Processing Units (GPUs) have been shown to be effective at achieving large speedups over co...
Abstract. The problem of verifying multi-threaded execution against the memory consistency model of ...
Modern memory consistency models are complex, and it is difficult to reason about the relaxed behavi...