Combining the strengths of MPIProf and IOT, an efficient and systematic method is devised for I/O characterization at the per-job, per-rank, per-file and per-call levels of HPC programs running on the NASA Advanced Supercomputing Center. This method is applied to answer four I/O questions in this paper. A total of 13 MPI programs and 15 cases, ranging from 24 to 5968 ranks, are analyzed to establish the I/O landscape from answers to the four questions. Four of the 13 programs use MPI I/O and the behavior of their collective writes depends on the specific implementation of the MPI library used. The SGI MPT library, the prevailing MPI library for our systems, was found to gather small writes from a large number of ranks to perform larger writ...
Parallelisation, serial optimisation, compiler tuning, and many more techniques are used to optimise...
The advance in both computation and data storage size in High Performance Computing (HPC) has not ...
Efficient usage of file systems poses a major challenge for highly scalable parallel applications. T...
The 2014 TOP500 supercomputer list includes over 40 deployed petascale systems, and the high perform...
With the exponential growth of high-fidelity sensor and simulated data, the scientific community is ...
High performance computing (HPC) is changing the way science is performed in the 21st Century; exper...
Benchmarking and analyzing I/O performance across high performance computing (HPC) platforms is nece...
In this paper we study the performance of the Lustre file system using five scientific and engineeri...
The computing power of high-performance computing (HPC) systems is increasing with a rapid growth in...
Input/Output (I/O) operations can represent a significant proportion of the run-time of parallel sci...
Input/Output (I/O) operations can represent a significant proportion of the run-time of parallel sci...
Many scientific applications have intense computational and I/O requirements. Although multiprocesso...
Multiprocessors have permitted astounding increases in computational performance, but many cannot me...
Due to the explosive growth in the size of scientific data sets, data-intensive computing is an emer...
Input/output (I/O) operations can represent a significant proportion of the run-time when large scie...
Parallelisation, serial optimisation, compiler tuning, and many more techniques are used to optimise...
The advance in both computation and data storage size in High Performance Computing (HPC) has not ...
Efficient usage of file systems poses a major challenge for highly scalable parallel applications. T...
The 2014 TOP500 supercomputer list includes over 40 deployed petascale systems, and the high perform...
With the exponential growth of high-fidelity sensor and simulated data, the scientific community is ...
High performance computing (HPC) is changing the way science is performed in the 21st Century; exper...
Benchmarking and analyzing I/O performance across high performance computing (HPC) platforms is nece...
In this paper we study the performance of the Lustre file system using five scientific and engineeri...
The computing power of high-performance computing (HPC) systems is increasing with a rapid growth in...
Input/Output (I/O) operations can represent a significant proportion of the run-time of parallel sci...
Input/Output (I/O) operations can represent a significant proportion of the run-time of parallel sci...
Many scientific applications have intense computational and I/O requirements. Although multiprocesso...
Multiprocessors have permitted astounding increases in computational performance, but many cannot me...
Due to the explosive growth in the size of scientific data sets, data-intensive computing is an emer...
Input/output (I/O) operations can represent a significant proportion of the run-time when large scie...
Parallelisation, serial optimisation, compiler tuning, and many more techniques are used to optimise...
The advance in both computation and data storage size in High Performance Computing (HPC) has not ...
Efficient usage of file systems poses a major challenge for highly scalable parallel applications. T...