This paper introduces the mig framework: an Open MPI extension to transparently support the migration of application processes, over different nodes of a distributed High-Performance Computing (HPC) system. The framework provides mechanism on top of which suitable resource managers can implement policies to react to hardware faults, address performance variability, improve resource utilization, perform a fine-grained load balancing and power thermal management. Compared to other state-of-the-art approaches, the mig framework does not require changes in the application code. Moreover, it is highly maintainable, since it is mainly a self-contained solution that has required a very few changes in other already existing Open MPI frameworks. Ex...
This paper presents the design and preliminary implementation of MpPVM, a software system that suppo...
Abstract—There is a clear trend towards using cloud re-sources in the scientific or the HPC communit...
We are currently involved in research to enable PVM to take advantage of shared networks of workstat...
This paper introduces the mig framework: an Open MPI extension to transparently support the migratio...
This is a post-peer-review, pre-copyedit version of an article published in Journal of Supercomputin...
[Abstract] Process migration provides many benefits for parallel environments including dynamic load...
A lot of research has been done on fault-tolerance for MPI applications, some on checkpoint/restart,...
Scientists use advanced computing techniques to assist in answering the complex questions at the for...
[Abstract] Execution times of large-scale computational science and engineering parallel application...
Thesis (Ph.D.) - Indiana University, Computer Sciences, 2010Scientists use advanced computing techni...
This report describes an implementation of MPI-1 on the GENESIS cluster operating system and compare...
The first version of MPI (Message Passing Interface) was released in 1994. At that time, scientific ...
Process migration is the ability to transfer a process from one machine to another. It is a useful f...
Maintaining a high rate of productivity, in terms of completed jobs per unit of time, in High-Perfor...
RESUMEN: El presente trabajo fin de grado, tiene como objetivo principal analizar la importancia de ...
This paper presents the design and preliminary implementation of MpPVM, a software system that suppo...
Abstract—There is a clear trend towards using cloud re-sources in the scientific or the HPC communit...
We are currently involved in research to enable PVM to take advantage of shared networks of workstat...
This paper introduces the mig framework: an Open MPI extension to transparently support the migratio...
This is a post-peer-review, pre-copyedit version of an article published in Journal of Supercomputin...
[Abstract] Process migration provides many benefits for parallel environments including dynamic load...
A lot of research has been done on fault-tolerance for MPI applications, some on checkpoint/restart,...
Scientists use advanced computing techniques to assist in answering the complex questions at the for...
[Abstract] Execution times of large-scale computational science and engineering parallel application...
Thesis (Ph.D.) - Indiana University, Computer Sciences, 2010Scientists use advanced computing techni...
This report describes an implementation of MPI-1 on the GENESIS cluster operating system and compare...
The first version of MPI (Message Passing Interface) was released in 1994. At that time, scientific ...
Process migration is the ability to transfer a process from one machine to another. It is a useful f...
Maintaining a high rate of productivity, in terms of completed jobs per unit of time, in High-Perfor...
RESUMEN: El presente trabajo fin de grado, tiene como objetivo principal analizar la importancia de ...
This paper presents the design and preliminary implementation of MpPVM, a software system that suppo...
Abstract—There is a clear trend towards using cloud re-sources in the scientific or the HPC communit...
We are currently involved in research to enable PVM to take advantage of shared networks of workstat...