Proceedings of the 4th IEEE International Symposium on Network Computing and Applications, NCA 2005, pp. 231-234.This paper presents techniques that improve the performance of parallel programs on distributed shared memory NUMA multiprocessors by implementing dynamic memory block and page migration. Our techniques address the latencies caused by the contention within the network and attempt to enhance data locality by migrating pages to reduce remote references. We analyze the behavior of eight multiprocessor applications which exhibit a wide range of network traffic patterns. Results show that several applications that encounter hot spots and network congestion see a reduction of run time by more than a factor of ten
This paper presents user-level dynamic page migration, a runtime technique which transparently enabl...
We introduce the concept of deadlock-free migration-based coherent shared memory to the NUCA family ...
Multiprocessors with shared memory are considered more general and easier to program than message-pa...
Due to advances in fiber optics and VLSI technology, interconnection networks that allow simultaneou...
This paper presents algorithms for improving the performance of parallel programs on multiprogrammed...
A data migration mechanism is proposed that allows an explicit and controlled mapping of data to mem...
This paper introduces the doubly-linked list (DLL) protocol for distributed shared memory (DSM) mult...
In this paper, we compare and contrast two techniques to improve capacity/conflict miss traffic in C...
A data migration mechanism is proposed that allows an explicit and controlled mapping of data to mem...
The page migration problem occurs in managing a globally addressed shared memory in a multiprocessor...
The performance of multiprogrammed shared-memory multiprocessors suffers often from scheduler interv...
This paper presents the design and preliminary implementation of MpPVM, a software system that suppo...
It is well known that contention is one of the factors that limit the performance of high performanc...
The need to achieve higher performance through greater degrees of parallelism necessitates distribut...
AbstractWe present an extension of a classical data management subproblem, the page migration. The p...
This paper presents user-level dynamic page migration, a runtime technique which transparently enabl...
We introduce the concept of deadlock-free migration-based coherent shared memory to the NUCA family ...
Multiprocessors with shared memory are considered more general and easier to program than message-pa...
Due to advances in fiber optics and VLSI technology, interconnection networks that allow simultaneou...
This paper presents algorithms for improving the performance of parallel programs on multiprogrammed...
A data migration mechanism is proposed that allows an explicit and controlled mapping of data to mem...
This paper introduces the doubly-linked list (DLL) protocol for distributed shared memory (DSM) mult...
In this paper, we compare and contrast two techniques to improve capacity/conflict miss traffic in C...
A data migration mechanism is proposed that allows an explicit and controlled mapping of data to mem...
The page migration problem occurs in managing a globally addressed shared memory in a multiprocessor...
The performance of multiprogrammed shared-memory multiprocessors suffers often from scheduler interv...
This paper presents the design and preliminary implementation of MpPVM, a software system that suppo...
It is well known that contention is one of the factors that limit the performance of high performanc...
The need to achieve higher performance through greater degrees of parallelism necessitates distribut...
AbstractWe present an extension of a classical data management subproblem, the page migration. The p...
This paper presents user-level dynamic page migration, a runtime technique which transparently enabl...
We introduce the concept of deadlock-free migration-based coherent shared memory to the NUCA family ...
Multiprocessors with shared memory are considered more general and easier to program than message-pa...