International audienceThis chapter describes the P2P-MPI project, a software framework aimed at the development of message-passing programs in large scale distributed networks of computers. Our goal is to provide a light-weight, self-contained software package that requires minimum effort to use and maintain. P2P-MPI relies on three features to reach this goal: i) its installation and use does not require administrator privileges, ii) available resources are discovered and selected for a computation without intervention from the user, iii) program executions can be fault-tolerant on user demand, in a completely transparent fashion (no checkpoint server to configure). P2P-MPI is typically suited for organizations having spare individual comp...
We present a unified fault-tolerance framework for task-parallel message-passing applications to mit...
Application developers are used to a homogeneous, reliable and easily manageable platform on which t...
Java has many features of interest to developers of large-scale parallel applications. At the same t...
International audienceWe present in this paper a study on fault management in a grid middleware. The...
Cette thèse propose l'intergiciel P2P-MPI pour faciliter l'utilisation des grilles de calcul.P2P-MPI...
We propose lightweight middleware solutions that facilitate and simplify the execution of failure-re...
In this paper we sketch out a proposed reference implementation for message passing in Java (MPJ), a...
The P2P-MPI system is a peer-to-peer based system that utilises network-enabled computers connected ...
Recently, there has been a lot of interest in using Java for parallel programming. Efforts have been...
Resilience and fault tolerance are challenging tasks in the field of high performance computing (HPC...
Fault tolerance in parallel systems has traditionally been achieved through a combination of redunda...
We sketch a proposed reference implementation for MPJ, the Java Grande Forum\u27s MPI-like message-p...
The goal of this research was to investigate the potential for employing dynamic, decentralized soft...
The aggregation of typical home computers through a peer-to-peer (P2P) framework over the Internet w...
International audienceHigh Performance computing generally involves some parallel applications to be...
We present a unified fault-tolerance framework for task-parallel message-passing applications to mit...
Application developers are used to a homogeneous, reliable and easily manageable platform on which t...
Java has many features of interest to developers of large-scale parallel applications. At the same t...
International audienceWe present in this paper a study on fault management in a grid middleware. The...
Cette thèse propose l'intergiciel P2P-MPI pour faciliter l'utilisation des grilles de calcul.P2P-MPI...
We propose lightweight middleware solutions that facilitate and simplify the execution of failure-re...
In this paper we sketch out a proposed reference implementation for message passing in Java (MPJ), a...
The P2P-MPI system is a peer-to-peer based system that utilises network-enabled computers connected ...
Recently, there has been a lot of interest in using Java for parallel programming. Efforts have been...
Resilience and fault tolerance are challenging tasks in the field of high performance computing (HPC...
Fault tolerance in parallel systems has traditionally been achieved through a combination of redunda...
We sketch a proposed reference implementation for MPJ, the Java Grande Forum\u27s MPI-like message-p...
The goal of this research was to investigate the potential for employing dynamic, decentralized soft...
The aggregation of typical home computers through a peer-to-peer (P2P) framework over the Internet w...
International audienceHigh Performance computing generally involves some parallel applications to be...
We present a unified fault-tolerance framework for task-parallel message-passing applications to mit...
Application developers are used to a homogeneous, reliable and easily manageable platform on which t...
Java has many features of interest to developers of large-scale parallel applications. At the same t...