Symmetric multiprocessors (SMPs) connected with low-latency networks provide attractive building blocks for software distributed shared memory systems. Two distinct approaches have been used: the fine-grain approach that instruments application loads and stores to support a small coherence granularity, and the coarse-grain approach based on virtual memory hardware that provides coherence at a page granularity. Fine-grain systems offer a simple migration path for applications developed on hardware multiprocessors by supporting coherence protocols similar to those implemented in hardware. On the other hand, coarse-grain systems can potentially provide higher performance through more optimized protocols and larger transfer granularities, whil...
This thesis describes and evaluates the effectiveness of four hardware mechanisms for software share...
Parallel workstations, each comprising 10-100 processors, promise cost-effective general-purpose mul...
We argue that OS-provided data coherence on non-cache-coherent NUMA multiprocessors (machines with a...
Symmetric multiprocessors (SMPs) connected with low-latency networks provide attractive building blo...
This paper reports our experience implementing the Blizzard fine-grain distributed shared memory sys...
Shared-memory architectures represent a class of parallel computer systems commonly used in the comm...
Plentiful research has addressed low-complexity software-based shared-memory systems since the idea ...
Clusters of workstations have long provided a cost-effective, large-scale parallel computing platfor...
A coarse-grain parallel program typically has one thread (task) per processor, whereas a fine-grain ...
Software fine-grain distributed shared memory (FGDSM) provides a simplified shared-memory programmin...
Parallel systems supporting a shared memory programming interface have been implemented both in soft...
A software distributed shared memory (DSM) system allows shared memory parallel programs to execute ...
Software fine-grain distributed shared memory (FGDSM) provides a simplified shared-memory programmin...
During the past few years, two main approaches have been taken to improve the performance of softwar...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
This thesis describes and evaluates the effectiveness of four hardware mechanisms for software share...
Parallel workstations, each comprising 10-100 processors, promise cost-effective general-purpose mul...
We argue that OS-provided data coherence on non-cache-coherent NUMA multiprocessors (machines with a...
Symmetric multiprocessors (SMPs) connected with low-latency networks provide attractive building blo...
This paper reports our experience implementing the Blizzard fine-grain distributed shared memory sys...
Shared-memory architectures represent a class of parallel computer systems commonly used in the comm...
Plentiful research has addressed low-complexity software-based shared-memory systems since the idea ...
Clusters of workstations have long provided a cost-effective, large-scale parallel computing platfor...
A coarse-grain parallel program typically has one thread (task) per processor, whereas a fine-grain ...
Software fine-grain distributed shared memory (FGDSM) provides a simplified shared-memory programmin...
Parallel systems supporting a shared memory programming interface have been implemented both in soft...
A software distributed shared memory (DSM) system allows shared memory parallel programs to execute ...
Software fine-grain distributed shared memory (FGDSM) provides a simplified shared-memory programmin...
During the past few years, two main approaches have been taken to improve the performance of softwar...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/16...
This thesis describes and evaluates the effectiveness of four hardware mechanisms for software share...
Parallel workstations, each comprising 10-100 processors, promise cost-effective general-purpose mul...
We argue that OS-provided data coherence on non-cache-coherent NUMA multiprocessors (machines with a...