There is a balance point in the design of a high performance computer (HPC) system at which the contribution to performance of the mass storage (I/O) infrastructure is neither too large nor too small. That balance point has commonly been identified heuristically. Our contribution quantifies the balance point by examining the relative costs and impact of compute resources versus I/O resources. If storage is a bottleneck, improving the I/O capability can raise system utilization, increasing throughput. The balance point is where the cost of increasing throughput by adding I/O capability is the same as the cost of doing so by adding nodes. Heuristics for System Balance Figure 1a) Heuristics b) 1GB/s per TB Memory capacity is the key determinan...
Abstract—Supercomputer I/O loads are often dominated by writes. HPC (High Performance Computing) fil...
• Supercomputer (n): A computer which turns a CPU-bound problem into an I/O bound problem. • As hig...
Load balance is critical for performance in large parallel applica-tions. An imbalance on today’s fa...
The issues to be addressed here are those of balance'' in machine architecture. By this, we mean how...
As parallel systems move into the production scientific-computing world, the emphasis will be on cos...
Scientific applications are diverse in terms of the re-source requirements, and tend to vary signifi...
The computing power of high-performance computing (HPC) systems is increasing with a rapid growth in...
One of the critical factors that affect the performance of many applications is load imbalance. App...
High performance computing (HPC) is changing the way science is performed in the 21st Century; exper...
Two key changes are driving an immediate need for deeper understanding of I/O workloads in high-perf...
Though input/output (I/O) from mass storage continues to be a bottleneck in current generation distr...
The compute capacity growth in high performance computing (HPC) systems is outperforming improvement...
Most recently, an important metric called energy proportional is presented as a guideline for ener...
Most recently, an important metric called energy proportional is presented as a guideline for ener...
Disk storage subsystems have not kept up the speed with processors. Processor performance has been i...
Abstract—Supercomputer I/O loads are often dominated by writes. HPC (High Performance Computing) fil...
• Supercomputer (n): A computer which turns a CPU-bound problem into an I/O bound problem. • As hig...
Load balance is critical for performance in large parallel applica-tions. An imbalance on today’s fa...
The issues to be addressed here are those of balance'' in machine architecture. By this, we mean how...
As parallel systems move into the production scientific-computing world, the emphasis will be on cos...
Scientific applications are diverse in terms of the re-source requirements, and tend to vary signifi...
The computing power of high-performance computing (HPC) systems is increasing with a rapid growth in...
One of the critical factors that affect the performance of many applications is load imbalance. App...
High performance computing (HPC) is changing the way science is performed in the 21st Century; exper...
Two key changes are driving an immediate need for deeper understanding of I/O workloads in high-perf...
Though input/output (I/O) from mass storage continues to be a bottleneck in current generation distr...
The compute capacity growth in high performance computing (HPC) systems is outperforming improvement...
Most recently, an important metric called energy proportional is presented as a guideline for ener...
Most recently, an important metric called energy proportional is presented as a guideline for ener...
Disk storage subsystems have not kept up the speed with processors. Processor performance has been i...
Abstract—Supercomputer I/O loads are often dominated by writes. HPC (High Performance Computing) fil...
• Supercomputer (n): A computer which turns a CPU-bound problem into an I/O bound problem. • As hig...
Load balance is critical for performance in large parallel applica-tions. An imbalance on today’s fa...