As the level of parallelism in manycore processors keeps increasing, providing efficient mechanisms for thread synchronization in concurrent programs is becoming a major concern. On cache-coherent shared-memory processors, synchronization efficiency is ultimately limited by the performance of the underlying cache coherence protocol. This paper studies how hardware support for message passing can improve synchronization performance. Considering the ubiquitous problem of mutual exclusion, we adapt two state-of-the-art solutions used on shared-memory processors, namely the server approach and the combining approach, to leverage the potential of hardware message passing. We propose HYBCOMB, a novel combining algorithm that uses both message pas...
The quest to improve performance forces designers to explore finer-grained multiprocessor machines. ...
Current and emerging high-performance parallel computer architectures generally implement one of two...
his paper addresses the problem of universal synchronization primitives that can support scalable th...
As the level of parallelism in manycore processors keeps increasing, providing efficient mechanisms ...
On the road to computer systems able to support the requirements of exascale applications, Chip Mult...
Multicore design is a major issue in modern computer architectures. Programmers are urged to design ...
This paper proposes a set of efficient primitives for process synchronization in multiprocessors. T...
AbstreetThis paper proposes a set of efficient primitives for process synchronization in muitiproces...
Shared memory multiprocessors are considered among the easiest parallel computers to program. Howeve...
Journal ArticleShared memory programs guarantee the correctness of concurrent accesses to shared dat...
Existing multiprocessor synchronization mechanisms are relatively heavyweight, due in part to the le...
International audienceNon-blocking collectives have been proposed so as to allow communications to b...
textThis dissertation explores techniques for reducing the costs of inter-processor communication i...
Single chip multicore processors are now prevalent and processors with hundreds of cores are being p...
Single-chip multiprocessors and multiple-thread architectures are becoming an affordable solution fo...
The quest to improve performance forces designers to explore finer-grained multiprocessor machines. ...
Current and emerging high-performance parallel computer architectures generally implement one of two...
his paper addresses the problem of universal synchronization primitives that can support scalable th...
As the level of parallelism in manycore processors keeps increasing, providing efficient mechanisms ...
On the road to computer systems able to support the requirements of exascale applications, Chip Mult...
Multicore design is a major issue in modern computer architectures. Programmers are urged to design ...
This paper proposes a set of efficient primitives for process synchronization in multiprocessors. T...
AbstreetThis paper proposes a set of efficient primitives for process synchronization in muitiproces...
Shared memory multiprocessors are considered among the easiest parallel computers to program. Howeve...
Journal ArticleShared memory programs guarantee the correctness of concurrent accesses to shared dat...
Existing multiprocessor synchronization mechanisms are relatively heavyweight, due in part to the le...
International audienceNon-blocking collectives have been proposed so as to allow communications to b...
textThis dissertation explores techniques for reducing the costs of inter-processor communication i...
Single chip multicore processors are now prevalent and processors with hundreds of cores are being p...
Single-chip multiprocessors and multiple-thread architectures are becoming an affordable solution fo...
The quest to improve performance forces designers to explore finer-grained multiprocessor machines. ...
Current and emerging high-performance parallel computer architectures generally implement one of two...
his paper addresses the problem of universal synchronization primitives that can support scalable th...