Abstract. This paper presents Locality-Aware Two-Phase (LATP) I/O, an opti-mization of the Two-Phase collective I/O technique from ROMIO, the most pop-ular MPI-IO implementation. In order to increase the locality of the file accesses, LATP employs the Linear Assignment Problem (LAP) for finding an optimal dis-tribution of data to processes, an aspect that is not considered in the original tech-nique. This assignment is based on the local data that each process stores and has as main purpose the reduction of the number of communication involved in the I/O collective operation and, therefore, the improvement of the global execution time. Compared with Two-Phase I/O, LATP I/O obtains important improvements in most of the considered scenarios.
The I/O access patterns of parallel programs often consist of accesses to a large number of small, n...
New le systems are critical to obtain good I/O performance on large multiprocessors. Several researc...
Many scientific applications are I/O intensive and have tremendous I/O requirements, including check...
This paper presents Locality-Aware Two-Phase (LATP) I/O, an optimization of the Two-Phase collective...
This work presents an optimization of MPI communications, called Dynamic-CoMPI, which uses two techn...
This paper presents Two-Phase Compressed I/O (TPC I/O,) an optimization of the Two-Phase collective ...
Two-phase I/O is a well-known strategy for implementing collective MPI-IO functions. It redistribute...
Abstract—MPI collective I/O is a widely used I/O method that helps data-intensive scientific applica...
This paper introduces a new concept called Multi-Collective I/O (MCIO) that extends conventional col...
The I/O access patterns of parallel programs often consist of accesses to a large number of small, n...
Abstract—I/O performance is vital for most HPC applications especially those that generate a vast am...
Abstract—The well-known gap between relative CPU speeds and storage bandwidth results in the need fo...
Abstract. In this paper we present the design, implementation and evaluation of a runtime system bas...
Currently, most scientific applications based on MPI adopt a compute-centric architecture. Needed da...
The increasing number of cores per node has propelled the performance of leadershipscale systems fro...
The I/O access patterns of parallel programs often consist of accesses to a large number of small, n...
New le systems are critical to obtain good I/O performance on large multiprocessors. Several researc...
Many scientific applications are I/O intensive and have tremendous I/O requirements, including check...
This paper presents Locality-Aware Two-Phase (LATP) I/O, an optimization of the Two-Phase collective...
This work presents an optimization of MPI communications, called Dynamic-CoMPI, which uses two techn...
This paper presents Two-Phase Compressed I/O (TPC I/O,) an optimization of the Two-Phase collective ...
Two-phase I/O is a well-known strategy for implementing collective MPI-IO functions. It redistribute...
Abstract—MPI collective I/O is a widely used I/O method that helps data-intensive scientific applica...
This paper introduces a new concept called Multi-Collective I/O (MCIO) that extends conventional col...
The I/O access patterns of parallel programs often consist of accesses to a large number of small, n...
Abstract—I/O performance is vital for most HPC applications especially those that generate a vast am...
Abstract—The well-known gap between relative CPU speeds and storage bandwidth results in the need fo...
Abstract. In this paper we present the design, implementation and evaluation of a runtime system bas...
Currently, most scientific applications based on MPI adopt a compute-centric architecture. Needed da...
The increasing number of cores per node has propelled the performance of leadershipscale systems fro...
The I/O access patterns of parallel programs often consist of accesses to a large number of small, n...
New le systems are critical to obtain good I/O performance on large multiprocessors. Several researc...
Many scientific applications are I/O intensive and have tremendous I/O requirements, including check...