In this paper we revisit the design of concurrent data structures -- specifically queues -- and examine their performance portability with regard to the move from conventional CPUs to graphics processors. We have looked at both lock-based and lock-free algorithms and have, for comparison, implemented and optimized the same algorithms on both graphics processors and multi-core CPUs. Particular interest has been paid to study the difference between the old Tesla and the new Fermi and Kepler architectures in this context. We provide a comprehensive evaluation and analysis of our implementations on all examined platforms. Our results indicate that the queues are in general performance portable, but that platform specific optimizations are poss...
The efficiency of concurrent data structures is crucial to the performance of multi-threaded program...
This paper investigates the synchronization power of coalesced memory accesses, a family of memory a...
Link to published version: http://ieeexplore.ieee.org/iel3/4440/12600/00580906.pdf?tp=&arnumber=5809...
In this paper we revisit the design of concurrent data structures -- specifically queues -- and exam...
As core counts increase and as heterogeneity becomes more common in parallel computing, we face the ...
Synchronization of concurrent threads is the central problem in order to design efficient concurrent...
The convergence of highly parallel many-core graphics processors with conventional multi-core proces...
Concurrent data structures provide the means to multi-threaded applications to share data. Typical d...
Abstract. In this work, we study the scalability, performance, design and implementation of basic da...
To get maximum performance on the many-core graphics processors it is important to have an even bala...
The concurrent priority queue is one of the shared memory data structures that can be dynamically ma...
In this paper, we revisit the design of synchronization primitives---specifically barriers, mutexes,...
This study analyzes the efficiency of parallel computational applications with the adoption of recen...
Data analyze has become very important with growth of information today. There is a need of real-tim...
\ua9 2017 by John Wiley & Sons, Inc. All rights reserved. Concurrent data structures are the data sh...
The efficiency of concurrent data structures is crucial to the performance of multi-threaded program...
This paper investigates the synchronization power of coalesced memory accesses, a family of memory a...
Link to published version: http://ieeexplore.ieee.org/iel3/4440/12600/00580906.pdf?tp=&arnumber=5809...
In this paper we revisit the design of concurrent data structures -- specifically queues -- and exam...
As core counts increase and as heterogeneity becomes more common in parallel computing, we face the ...
Synchronization of concurrent threads is the central problem in order to design efficient concurrent...
The convergence of highly parallel many-core graphics processors with conventional multi-core proces...
Concurrent data structures provide the means to multi-threaded applications to share data. Typical d...
Abstract. In this work, we study the scalability, performance, design and implementation of basic da...
To get maximum performance on the many-core graphics processors it is important to have an even bala...
The concurrent priority queue is one of the shared memory data structures that can be dynamically ma...
In this paper, we revisit the design of synchronization primitives---specifically barriers, mutexes,...
This study analyzes the efficiency of parallel computational applications with the adoption of recen...
Data analyze has become very important with growth of information today. There is a need of real-tim...
\ua9 2017 by John Wiley & Sons, Inc. All rights reserved. Concurrent data structures are the data sh...
The efficiency of concurrent data structures is crucial to the performance of multi-threaded program...
This paper investigates the synchronization power of coalesced memory accesses, a family of memory a...
Link to published version: http://ieeexplore.ieee.org/iel3/4440/12600/00580906.pdf?tp=&arnumber=5809...