In this work, a model of computation for shared memory parallelism is presented. To address fundamental constraints of modern memory systems, the presented model constrains how parallelism interacts with memory access patterns and in doing so provides a method for design and analysis of algorithms that estimates reliable execution time based on a few architectural parameters. This model is presented as an alternative to modern thread based models that focus on computational concurrency but rely on reactive hardware policies to hide and amortize memory latency. Since modern processors use reactive mechanisms and heuristics to deduce the data access requirement of computations, the memory access costs of these threaded programs may be difficu...
Multi-core processors have become the dominant processor architecture with 2, 4, and 8 cores on a ch...
The last two decade has witnessed two opposing hardware trends where the DRAM capacity and the acces...
The `free' speed-up stemming from ever increasing processor speed is over. Performance increase in ...
We address the gap between structured parallel programming and parallel architectures by formalizing...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Multicore and many-core architectures have penetrated the vast majority of computing systems, from h...
Shared memory models have been criticized for years for failing to model essential realities of para...
There has been a great deal of interest recently in the development of general-purpose bridging mode...
AbstractIn this paper we show how parallel algorithms can be turned into efficient streaming algorit...
The field of streaming algorithms has enjoyed a deal of focus from the theoretical computer science ...
The capability of the Random Access Machine (RAM) to execute any instruction in constant time is not...
Expressing concurrency in applications has always been a difficult and error-prone endeavor, yet eff...
We introduce K-model, a computational model to evaluate the algorithms designed for graphic processo...
With emerging many-core architectures, using on-chip shared memories is an interesting approach beca...
We study the problem of minimizing total completion time on parallel machines subject to varying pro...
Multi-core processors have become the dominant processor architecture with 2, 4, and 8 cores on a ch...
The last two decade has witnessed two opposing hardware trends where the DRAM capacity and the acces...
The `free' speed-up stemming from ever increasing processor speed is over. Performance increase in ...
We address the gap between structured parallel programming and parallel architectures by formalizing...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Multicore and many-core architectures have penetrated the vast majority of computing systems, from h...
Shared memory models have been criticized for years for failing to model essential realities of para...
There has been a great deal of interest recently in the development of general-purpose bridging mode...
AbstractIn this paper we show how parallel algorithms can be turned into efficient streaming algorit...
The field of streaming algorithms has enjoyed a deal of focus from the theoretical computer science ...
The capability of the Random Access Machine (RAM) to execute any instruction in constant time is not...
Expressing concurrency in applications has always been a difficult and error-prone endeavor, yet eff...
We introduce K-model, a computational model to evaluate the algorithms designed for graphic processo...
With emerging many-core architectures, using on-chip shared memories is an interesting approach beca...
We study the problem of minimizing total completion time on parallel machines subject to varying pro...
Multi-core processors have become the dominant processor architecture with 2, 4, and 8 cores on a ch...
The last two decade has witnessed two opposing hardware trends where the DRAM capacity and the acces...
The `free' speed-up stemming from ever increasing processor speed is over. Performance increase in ...