This paper presents user-level dynamic page migration, a runtime technique which transparently enables parallel programs to tune their memory performance on distributed shared memory multiprocessors, with feedback obtained from dynamic monitoring of memory activity. Our technique exploits the iterative nature of parallel programs and information available to the program both at compile time and at runtime in order to improve the accuracy and the timeliness of page migrations, as well as amortize better the overhead, compared to page migration engines implemented in the operating system. We present an adaptive page migration algorithm based on a competitive and a predictive criterion. The competitive criterion is used to correct poor page pl...
Several cache-coherent shared-memory multiprocessors have been developed that are scalable and offer...
This paper makes two important contributions. First, the pa-per investigates the performance implica...
The cost of a cache miss depends heavily on the location of the main memory that backs the missing l...
This paper presents algorithms for improving the performance of parallel programs on multiprogrammed...
The performance of multiprogrammed shared-memory multiprocessors suffers often from scheduler interv...
This paper describes transparent mechanisms for emulating some of the data distribution facilities ...
jesus,eduard¦ Abstract. This paper describes transparent mechanisms for emulating some of the data d...
The page migration problem occurs in managing a globally addressed shared memory in a multiprocessor...
AbstractThis paper is concerned with the page migration (or file migration) problem (Black and Sleat...
This paper makes two important contributions. First, the paper investigates the performance implicat...
This paper makes two important contributions. First, the paper investigates the performance implicat...
Static cache partitioning can reduce inter-application cache interference and improve the composite ...
This paper investigates the performance implications of data placement in OpenMP programs running on...
Virtual memory offers a simple hardware abstraction to programmers freeing them from the tedious pro...
NUMA abilities such as explicit migration of mem-ory buffers enable flexible placement of data buffe...
Several cache-coherent shared-memory multiprocessors have been developed that are scalable and offer...
This paper makes two important contributions. First, the pa-per investigates the performance implica...
The cost of a cache miss depends heavily on the location of the main memory that backs the missing l...
This paper presents algorithms for improving the performance of parallel programs on multiprogrammed...
The performance of multiprogrammed shared-memory multiprocessors suffers often from scheduler interv...
This paper describes transparent mechanisms for emulating some of the data distribution facilities ...
jesus,eduard¦ Abstract. This paper describes transparent mechanisms for emulating some of the data d...
The page migration problem occurs in managing a globally addressed shared memory in a multiprocessor...
AbstractThis paper is concerned with the page migration (or file migration) problem (Black and Sleat...
This paper makes two important contributions. First, the paper investigates the performance implicat...
This paper makes two important contributions. First, the paper investigates the performance implicat...
Static cache partitioning can reduce inter-application cache interference and improve the composite ...
This paper investigates the performance implications of data placement in OpenMP programs running on...
Virtual memory offers a simple hardware abstraction to programmers freeing them from the tedious pro...
NUMA abilities such as explicit migration of mem-ory buffers enable flexible placement of data buffe...
Several cache-coherent shared-memory multiprocessors have been developed that are scalable and offer...
This paper makes two important contributions. First, the pa-per investigates the performance implica...
The cost of a cache miss depends heavily on the location of the main memory that backs the missing l...