Large-scale shared-memory multiprocessors typically have long latencies for remote data accesses. A key issue for execution performance of many common applications is the synchronization cost. The communication scalability of synchronization has been improved by the introduction of queue-based spin-locks instead of Test&(Test&Set). For architectures with long access latencies for global data, attention should also be paid to the number of global accesses that are involved in synchronization. We present a method to characterize the performance of proposed queue lock algorithms, and apply it to previously published algorithms. We also present two new queue locks, the LH lock and the M lock. We compare the locks in terms of performance...
Predictable interprocessor synchronization and fast interrupt response are important for real-time s...
We present a fast and scalable lock algorithm for shared-memory multiprocessors addressing the resou...
We present a scalable lock algorithm and an adaptive scheme for shared-memory multiprocessors addres...
Large-scale shared-memory multiprocessors typically have long latencies for remote data accesses. A...
Synchronization primitives for large scale multiprocessors need to provide low latency and low conte...
Busy-wait techniques are heavily used for mutual exclusion and barrier synchronization in shared-mem...
Link to published version: http://portal.acm.org/ft_gateway.cfm?id=379566&type=pdf&coll=portal&dl=AC...
Busy-wait techniques are heavily used for mutual exclusion and barrier synchronization in shared-mem...
Queue-based spin locks allow programs with busy-wait syn-chronization to scale to very large multipr...
Scalable busy-wait synchronization algorithms are essential for achieving good parallel program perf...
Efficient synchronization is important for achieving good performance in parallel programs, especial...
Scalable busy-wait synchronization algorithms are essential for achieving good parallel program perf...
Journal ArticleShared memory programs guarantee the correctness of concurrent accesses to shared dat...
Efficient synchronization is important for achieving good performance in parallel programs, especial...
Abstract. Synchronization in parallel programs is a major performance bottleneck. Shared data is pro...
Predictable interprocessor synchronization and fast interrupt response are important for real-time s...
We present a fast and scalable lock algorithm for shared-memory multiprocessors addressing the resou...
We present a scalable lock algorithm and an adaptive scheme for shared-memory multiprocessors addres...
Large-scale shared-memory multiprocessors typically have long latencies for remote data accesses. A...
Synchronization primitives for large scale multiprocessors need to provide low latency and low conte...
Busy-wait techniques are heavily used for mutual exclusion and barrier synchronization in shared-mem...
Link to published version: http://portal.acm.org/ft_gateway.cfm?id=379566&type=pdf&coll=portal&dl=AC...
Busy-wait techniques are heavily used for mutual exclusion and barrier synchronization in shared-mem...
Queue-based spin locks allow programs with busy-wait syn-chronization to scale to very large multipr...
Scalable busy-wait synchronization algorithms are essential for achieving good parallel program perf...
Efficient synchronization is important for achieving good performance in parallel programs, especial...
Scalable busy-wait synchronization algorithms are essential for achieving good parallel program perf...
Journal ArticleShared memory programs guarantee the correctness of concurrent accesses to shared dat...
Efficient synchronization is important for achieving good performance in parallel programs, especial...
Abstract. Synchronization in parallel programs is a major performance bottleneck. Shared data is pro...
Predictable interprocessor synchronization and fast interrupt response are important for real-time s...
We present a fast and scalable lock algorithm for shared-memory multiprocessors addressing the resou...
We present a scalable lock algorithm and an adaptive scheme for shared-memory multiprocessors addres...