The computational needs of many applications outstrip the capabilities of a single compute node. Communication is necessary to employ multiple nodes, but slow communication often limits application performance on multiple nodes. To improve communication performance, developers need tools that enable them to understand how their application’s communication patterns interact with the network, especially when those interactions result in congestion. Since communication performance is difficult to reason about analytically and simulation is costly, measurement-based approaches are needed. This thesis describes a new sampling-based technique to collect information about the path a packet takes and congestion it encounters. Experiments with simul...
The relative performance of different data collection methods in the assessmentofvarious traffic par...
Network performance in tightly-coupled multiprocessors typically degrades rapidly beyond network sat...
This paper looks at adaptive applications that can switch between a small number of different levels...
In order to be able to develop robust and effective parallel applications and algorithms, one should...
Abstract—Network congestion is one of the primary causes of performance degradation, performance var...
Inter-node networks are a key capability of High-Performance Computing (HPC) systems that differenti...
Although one of the key characteristics of High Performance Computing (HPC) infrastructures are thei...
Detecting the points of network congestion is an intriguing research problem, because this informati...
Distributed control protocols routinely have to operate oblivious of dynamic network information for...
Opportunistic networks are a subset of delay tolerant networks where the contacts are unscheduled. S...
Copyright © 2007 Elsevier Ltd All rights reserved.Interconnection networks in current parallel syste...
Abstract—Collecting per-flow aggregates in high-speed links is challenging and usually requires traf...
Large-scale compute clusters are highly affected by performance variability that originates from dif...
Congestion is an important issue in networks and significantly affects network performance. Various ...
Abstract: A congestion avoidance scheme allows a network to operate in the region of low delay and h...
The relative performance of different data collection methods in the assessmentofvarious traffic par...
Network performance in tightly-coupled multiprocessors typically degrades rapidly beyond network sat...
This paper looks at adaptive applications that can switch between a small number of different levels...
In order to be able to develop robust and effective parallel applications and algorithms, one should...
Abstract—Network congestion is one of the primary causes of performance degradation, performance var...
Inter-node networks are a key capability of High-Performance Computing (HPC) systems that differenti...
Although one of the key characteristics of High Performance Computing (HPC) infrastructures are thei...
Detecting the points of network congestion is an intriguing research problem, because this informati...
Distributed control protocols routinely have to operate oblivious of dynamic network information for...
Opportunistic networks are a subset of delay tolerant networks where the contacts are unscheduled. S...
Copyright © 2007 Elsevier Ltd All rights reserved.Interconnection networks in current parallel syste...
Abstract—Collecting per-flow aggregates in high-speed links is challenging and usually requires traf...
Large-scale compute clusters are highly affected by performance variability that originates from dif...
Congestion is an important issue in networks and significantly affects network performance. Various ...
Abstract: A congestion avoidance scheme allows a network to operate in the region of low delay and h...
The relative performance of different data collection methods in the assessmentofvarious traffic par...
Network performance in tightly-coupled multiprocessors typically degrades rapidly beyond network sat...
This paper looks at adaptive applications that can switch between a small number of different levels...