Parallel programming models should attempt to satisfy two conflicting goals. On one hand, they should hide architec-tural details so that algorithm designers can write simple, portable programs. On the other hand, models must expose architectural details so that designers can evaluate and op-timize the performance of their algorithms. In this paper, we experimentally examine the trade-offs made by a simple shared-memory model, QSM, to address this dilemma. The results indicate that analysis under the QSM model yields quite accurate results for reasonable input sizes and that al-gorithms developed under QSM achieve performance close to that obtainable through more complex models, such as BSP and LogP.
Parallel programming models are quite challenging and emerging topic in the parallel computing era. ...
In this paper we present a new approach to benchmark the performance of shared memory systems. This ...
Parallel graph reduction is a conceptually simple model for the concurrent evaluation of lazy functi...
Parallel programming models should attempt to satisfy two conflicting goals. On one hand, they shoul...
We present work-preserving emulations with small slowdown between LogP and two other parallel models...
Abstract We present work-preserving emulations with small slowdown between LogP and two other parall...
We present work-preserving emulations with small slowdown between LogP and two other parallel models...
Shared memory models have been criticized for years for failing to model essential realities of para...
There has been a great deal of interest recently in the development of general-purpose bridging mode...
The goal of this work was to examine existing shared memory parallel programming models, figure out ...
Data locality is a well-recognized requirement for the development of any parallel application, but ...
We evaluate the claim that a PRAM-like parallel programming model (XMTC) requires less effort than a...
This paper develops and validates an analytical model for evaluating various types of architectural ...
Most current multiprocessor architectures and shared memory parallel program-ming languages are not ...
The majority of current HPC applications are composed of complex and irregular data structures that ...
Parallel programming models are quite challenging and emerging topic in the parallel computing era. ...
In this paper we present a new approach to benchmark the performance of shared memory systems. This ...
Parallel graph reduction is a conceptually simple model for the concurrent evaluation of lazy functi...
Parallel programming models should attempt to satisfy two conflicting goals. On one hand, they shoul...
We present work-preserving emulations with small slowdown between LogP and two other parallel models...
Abstract We present work-preserving emulations with small slowdown between LogP and two other parall...
We present work-preserving emulations with small slowdown between LogP and two other parallel models...
Shared memory models have been criticized for years for failing to model essential realities of para...
There has been a great deal of interest recently in the development of general-purpose bridging mode...
The goal of this work was to examine existing shared memory parallel programming models, figure out ...
Data locality is a well-recognized requirement for the development of any parallel application, but ...
We evaluate the claim that a PRAM-like parallel programming model (XMTC) requires less effort than a...
This paper develops and validates an analytical model for evaluating various types of architectural ...
Most current multiprocessor architectures and shared memory parallel program-ming languages are not ...
The majority of current HPC applications are composed of complex and irregular data structures that ...
Parallel programming models are quite challenging and emerging topic in the parallel computing era. ...
In this paper we present a new approach to benchmark the performance of shared memory systems. This ...
Parallel graph reduction is a conceptually simple model for the concurrent evaluation of lazy functi...