This paper presents algorithms for improving the performance of parallel programs on multiprogrammed sharedmemory NUMA multiprocessors, via the use of user-level dynamic page migration. The idea that drives the algorithms is that a page migration engine can perform accurate and timely page migrations in a multiprogrammed system if it can correlate page reference information with scheduling information obtained from the operating system. The necessary page migrations can be performed as a response to scheduling events that break the implicit association between threads and their memory affinity sets. We present two algorithms that use feedback from the kernel scheduler to aggressively migrate pages upon thread migrations. The first algorithm...
Thread migration is established as a mechanism for achieving dynamic load sharing and data lo-cality...
Thread migration is established as a mechanism for achieving dynamic load sharing and data locality....
This paper compares data distribution methodologies for scaling the perfor-mance of OpenMP on NUMA a...
This paper presents user-level dynamic page migration, a runtime technique which transparently enabl...
The performance of multiprogrammed shared-memory multiprocessors suffers often from scheduler interv...
The page migration problem occurs in managing a globally addressed shared memory in a multiprocessor...
Several cache-coherent shared-memory multiprocessors have been developed that are scalable and offer...
This paper describes transparent mechanisms for emulating some of the data distribution facilities ...
jesus,eduard¦ Abstract. This paper describes transparent mechanisms for emulating some of the data d...
AbstractThis paper is concerned with the page migration (or file migration) problem (Black and Sleat...
In this paper we describe the way thread migration can be carried out in Distributed Shared Memory (...
Virtual memory offers a simple hardware abstraction to programmers freeing them from the tedious pro...
This paper compares data distribution methodologies for scaling the performance of OpenMP on NUMA ar...
Proceedings of the 4th IEEE International Symposium on Network Computing and Applications, NCA 2005,...
In this work we study the effect of data locality on the performance of Gaussian 03 code running on ...
Thread migration is established as a mechanism for achieving dynamic load sharing and data lo-cality...
Thread migration is established as a mechanism for achieving dynamic load sharing and data locality....
This paper compares data distribution methodologies for scaling the perfor-mance of OpenMP on NUMA a...
This paper presents user-level dynamic page migration, a runtime technique which transparently enabl...
The performance of multiprogrammed shared-memory multiprocessors suffers often from scheduler interv...
The page migration problem occurs in managing a globally addressed shared memory in a multiprocessor...
Several cache-coherent shared-memory multiprocessors have been developed that are scalable and offer...
This paper describes transparent mechanisms for emulating some of the data distribution facilities ...
jesus,eduard¦ Abstract. This paper describes transparent mechanisms for emulating some of the data d...
AbstractThis paper is concerned with the page migration (or file migration) problem (Black and Sleat...
In this paper we describe the way thread migration can be carried out in Distributed Shared Memory (...
Virtual memory offers a simple hardware abstraction to programmers freeing them from the tedious pro...
This paper compares data distribution methodologies for scaling the performance of OpenMP on NUMA ar...
Proceedings of the 4th IEEE International Symposium on Network Computing and Applications, NCA 2005,...
In this work we study the effect of data locality on the performance of Gaussian 03 code running on ...
Thread migration is established as a mechanism for achieving dynamic load sharing and data lo-cality...
Thread migration is established as a mechanism for achieving dynamic load sharing and data locality....
This paper compares data distribution methodologies for scaling the perfor-mance of OpenMP on NUMA a...