This paper presents Locality-Aware Two-Phase (LATP) I/O, an optimization of the Two-Phase collective I/O technique from ROMIO, the most popular MPI-IO implementation. In order to increase the locality of the file accesses, LATP employs the Linear Assignment Problem (LAP) for finding an optimal distribution of data to processes, an aspect that is not considered in the original technique. This assignment is based on the local data that each process stores and has as main purpose the reduction of the number of communication involved in the I/O collective operation and, therefore, the improvement of the global execution time. Compared with Two-Phase I/O, LATP I/O obtains important improvements in most of the considered scenarios.</p
Abstract. In this paper we present the design, implementation and evaluation of a runtime system bas...
The increasing number of cores per node has propelled the performance of leadershipscale systems fro...
Many scientific applications are I/O intensive and have tremendous I/O requirements, including check...
This paper presents Locality-Aware Two-Phase (LATP) I/O, an optimization of the Two-Phase collective...
Abstract. This paper presents Locality-Aware Two-Phase (LATP) I/O, an opti-mization of the Two-Phase...
This work presents an optimization of MPI communications, called Dynamic-CoMPI, which uses two techn...
Two-phase I/O is a well-known strategy for implementing collective MPI-IO functions. It redistribute...
This paper presents Two-Phase Compressed I/O (TPC I/O,) an optimization of the Two-Phase collective ...
Abstract—MPI collective I/O is a widely used I/O method that helps data-intensive scientific applica...
The I/O access patterns of parallel programs often consist of accesses to a large number of small, n...
This paper introduces a new concept called Multi-Collective I/O (MCIO) that extends conventional col...
Abstract—The well-known gap between relative CPU speeds and storage bandwidth results in the need fo...
Abstract—I/O performance is vital for most HPC applications especially those that generate a vast am...
Currently, most scientific applications based on MPI adopt a compute-centric architecture. Needed da...
The I/O access patterns of parallel programs often consist of accesses to a large number of small, n...
Abstract. In this paper we present the design, implementation and evaluation of a runtime system bas...
The increasing number of cores per node has propelled the performance of leadershipscale systems fro...
Many scientific applications are I/O intensive and have tremendous I/O requirements, including check...
This paper presents Locality-Aware Two-Phase (LATP) I/O, an optimization of the Two-Phase collective...
Abstract. This paper presents Locality-Aware Two-Phase (LATP) I/O, an opti-mization of the Two-Phase...
This work presents an optimization of MPI communications, called Dynamic-CoMPI, which uses two techn...
Two-phase I/O is a well-known strategy for implementing collective MPI-IO functions. It redistribute...
This paper presents Two-Phase Compressed I/O (TPC I/O,) an optimization of the Two-Phase collective ...
Abstract—MPI collective I/O is a widely used I/O method that helps data-intensive scientific applica...
The I/O access patterns of parallel programs often consist of accesses to a large number of small, n...
This paper introduces a new concept called Multi-Collective I/O (MCIO) that extends conventional col...
Abstract—The well-known gap between relative CPU speeds and storage bandwidth results in the need fo...
Abstract—I/O performance is vital for most HPC applications especially those that generate a vast am...
Currently, most scientific applications based on MPI adopt a compute-centric architecture. Needed da...
The I/O access patterns of parallel programs often consist of accesses to a large number of small, n...
Abstract. In this paper we present the design, implementation and evaluation of a runtime system bas...
The increasing number of cores per node has propelled the performance of leadershipscale systems fro...
Many scientific applications are I/O intensive and have tremendous I/O requirements, including check...