ABSTRACT There is a growing need to perform large computations on small systems, as access to large systems is not widely available and cannot keep up with the scaling of data. BDMPI was recently introduced as a way of achieving this for applications written in MPI. BDMPI allows the efficient execution of standard MPI programs on systems whose aggregate amount of memory is smaller than that required by the computations and significantly outperforms other approaches. In this paper we present a virtual memory subsystem which we implemented as part of the BDMPI runtime. Our new virtual memory subsystem, which we call SBMA, bypasses the operating system virtual memory manager to take advantage of BDMPI's node-level cooperative multi-taking...
Simulation is a popular approach for predicting the performance of MPI applications for platforms th...
Abstract. Processor virtualization is a powerful technique that enables the run-time system to carry...
The work in this paper focuses on providing malleability to MPI applications by using a novel perfor...
The processing of massive amounts of data on clusters with finite amount of memory has become an imp...
Parallel algorithms for the Bulk Synchronous Parallel (BSP) and closely related Coarse Gained Multic...
Scalability to large number of processes is one of the weaknesses of current MPI implementations. St...
Performance-hungry data center applications demand increasingly higher performance from their storag...
Shared memory systems, such as SMP and ccNUMA topologies, simplify programming and administration. ...
This work focuses on scheduling of MPI jobs when executing in shared-memory multiprocessors (SMPs). ...
Large-scale shared-memory multiprocessors such as the BBN Butterfly and IBM RP3 introduce a new leve...
In this paper, we evaluate the overheads of virtualization in commercial multicore architectures wit...
Virtual memory provides the illusion of unlimited memory by automating the transfer of data between ...
This thesis describes design and implementation of effective memory management for multi-threaded ap...
The thesis investigates the Message Passing Interface (MPI) support for shared memory programming on...
Parallel machines with an extremely large number of processors (at least tens of thousands processor...
Simulation is a popular approach for predicting the performance of MPI applications for platforms th...
Abstract. Processor virtualization is a powerful technique that enables the run-time system to carry...
The work in this paper focuses on providing malleability to MPI applications by using a novel perfor...
The processing of massive amounts of data on clusters with finite amount of memory has become an imp...
Parallel algorithms for the Bulk Synchronous Parallel (BSP) and closely related Coarse Gained Multic...
Scalability to large number of processes is one of the weaknesses of current MPI implementations. St...
Performance-hungry data center applications demand increasingly higher performance from their storag...
Shared memory systems, such as SMP and ccNUMA topologies, simplify programming and administration. ...
This work focuses on scheduling of MPI jobs when executing in shared-memory multiprocessors (SMPs). ...
Large-scale shared-memory multiprocessors such as the BBN Butterfly and IBM RP3 introduce a new leve...
In this paper, we evaluate the overheads of virtualization in commercial multicore architectures wit...
Virtual memory provides the illusion of unlimited memory by automating the transfer of data between ...
This thesis describes design and implementation of effective memory management for multi-threaded ap...
The thesis investigates the Message Passing Interface (MPI) support for shared memory programming on...
Parallel machines with an extremely large number of processors (at least tens of thousands processor...
Simulation is a popular approach for predicting the performance of MPI applications for platforms th...
Abstract. Processor virtualization is a powerful technique that enables the run-time system to carry...
The work in this paper focuses on providing malleability to MPI applications by using a novel perfor...