We are currently involved in research to enable PVM to take advantage of shared networks of workstations (NOWs) more effectively. In such a computing environment, it is important to utilize workstations unobtrusively and recover from machine failures. Towards this goal, we have enhanced PVM with transparent task migration, checkpointing, and global scheduling. These enhancements are part of the MIST project which takes an open systems approach in developing a cohesive, distributed parallel computing environment. This open systems approach promotes plug-and-play integration of independently developed modules, such as Condor, DQS, AVS, Prospero, XPVM, PIOUS, Ptools, etc. Transparent task migration, in conjunction with a global scheduler, faci...
This article presents mEDA-2, an extension to PVM which provides Virtual Shared Memory, VSM, for int...
This dissertation describes the design, implementation, and performance of two mechanisms that addre...
One of the more bothersome aspects of developing a parallel program is that of monitoring the behavi...
Parallel Virtual Machine (PVM) is a widely-used software system that allows a heterogeneous set of p...
Parallel programming on clusters of workstations is increasingly attractive, but dynamic load balanc...
The paper describes a parallel program checkpointing mechanism and its potential application in Grid...
This paper presents an source-level software system, PMT, which performs task migrations for long-ru...
Many scientific problems benefit from computations that are parallel at a coarse grain. Collections ...
There is a growing trend toward distributed computing -- writing programs that run across multiple n...
Adaptive load distribution is necessary for parallel applications to co-exist e ectively with other ...
Process migration is the ability to transfer a process from one machine to another. It is a useful f...
This paper presents the design and preliminary implementation of MpPVM, a software system that suppo...
Transparent hypervisor-level checkpoint-restart mechanisms for virtual clusters (VCs) or clusters of...
This paper introduces the mig framework: an Open MPI extension to transparently support the migratio...
Checkpointing of parallel applications can be used as the core technology to provide process migrati...
This article presents mEDA-2, an extension to PVM which provides Virtual Shared Memory, VSM, for int...
This dissertation describes the design, implementation, and performance of two mechanisms that addre...
One of the more bothersome aspects of developing a parallel program is that of monitoring the behavi...
Parallel Virtual Machine (PVM) is a widely-used software system that allows a heterogeneous set of p...
Parallel programming on clusters of workstations is increasingly attractive, but dynamic load balanc...
The paper describes a parallel program checkpointing mechanism and its potential application in Grid...
This paper presents an source-level software system, PMT, which performs task migrations for long-ru...
Many scientific problems benefit from computations that are parallel at a coarse grain. Collections ...
There is a growing trend toward distributed computing -- writing programs that run across multiple n...
Adaptive load distribution is necessary for parallel applications to co-exist e ectively with other ...
Process migration is the ability to transfer a process from one machine to another. It is a useful f...
This paper presents the design and preliminary implementation of MpPVM, a software system that suppo...
Transparent hypervisor-level checkpoint-restart mechanisms for virtual clusters (VCs) or clusters of...
This paper introduces the mig framework: an Open MPI extension to transparently support the migratio...
Checkpointing of parallel applications can be used as the core technology to provide process migrati...
This article presents mEDA-2, an extension to PVM which provides Virtual Shared Memory, VSM, for int...
This dissertation describes the design, implementation, and performance of two mechanisms that addre...
One of the more bothersome aspects of developing a parallel program is that of monitoring the behavi...