Fig. 1. Network traffic resulting from two different runs of the parallel simulation pF3D. This simulation models laser plasma interaction inside of a hohlraum chamber by decomposing the domain into a set of blocks (left). Depending on how data blocks are mapped to processor cores (middle), different communication patterns occur. When staggering data placement (bottom right) we observe significantly more balanced communication compared to a default mapping similar to how the domain is decomposed (top right). Abstract—The performance of massively parallel applications is often heavily impacted by the cost of communication among compute nodes. However, determining how to best use the network is a formidable task, made challenging by the ever ...
Overlapping communication and computation has been devised as an attractive technique to alleviate t...
Abstract—The Extreme-scale Simulator (xSim) is a recently developed performance investigation toolki...
Torus networks are an attractive topology in supercomputing, balancing the tradeoff between network ...
Abstract—Understanding the interactions between a parallel application and the interconnection netwo...
Network analysis software relies on graph layout algorithms to enable users to visually explore netw...
We are exploring the development and application of information visualization techniques for the ana...
The overall efficiency of an extreme-scale supercomputer largely relies on the performance of its ne...
speedup As the scale of parallel machine grows, communication network is playing more important role...
Executions of modern parallel programs often yield complex communications among compute nodes of lar...
Current and future supercomputers have tens of thousands of compute nodes interconnected with high-d...
In order to be able to develop robust and effective parallel applications and algorithms, one should...
The performance of parallel and distributed applications is highly dependent on the characteristics ...
Simulation provides a flexible and valuable method to study the behavior of information propagation ...
In the early years of parallel computing research, significant theoretical studies were done on inte...
Networks based on the High Performance Parallel Interface (HIPPI) will become the norm at LANL. The ...
Overlapping communication and computation has been devised as an attractive technique to alleviate t...
Abstract—The Extreme-scale Simulator (xSim) is a recently developed performance investigation toolki...
Torus networks are an attractive topology in supercomputing, balancing the tradeoff between network ...
Abstract—Understanding the interactions between a parallel application and the interconnection netwo...
Network analysis software relies on graph layout algorithms to enable users to visually explore netw...
We are exploring the development and application of information visualization techniques for the ana...
The overall efficiency of an extreme-scale supercomputer largely relies on the performance of its ne...
speedup As the scale of parallel machine grows, communication network is playing more important role...
Executions of modern parallel programs often yield complex communications among compute nodes of lar...
Current and future supercomputers have tens of thousands of compute nodes interconnected with high-d...
In order to be able to develop robust and effective parallel applications and algorithms, one should...
The performance of parallel and distributed applications is highly dependent on the characteristics ...
Simulation provides a flexible and valuable method to study the behavior of information propagation ...
In the early years of parallel computing research, significant theoretical studies were done on inte...
Networks based on the High Performance Parallel Interface (HIPPI) will become the norm at LANL. The ...
Overlapping communication and computation has been devised as an attractive technique to alleviate t...
Abstract—The Extreme-scale Simulator (xSim) is a recently developed performance investigation toolki...
Torus networks are an attractive topology in supercomputing, balancing the tradeoff between network ...