We present a monitoring system for large-scale parallel and distributed computing environments that allows to trade-off accuracy in a tunable fashion to gain scalability without compromising fidelity. The approach relies on classifying each gathered monitoring metric based on individual needs and on aggregating messages containing classes of individual monitoring metrics using a tree-based overlay network. The MRNet-based prototype is able to significantly reduce the amount of gathered and stored monitoring data, e.g., by a factor of ≈56 in comparison to the Ganglia distributed monitoring system. A simple scaling study reveals, however, that further efforts are needed in reducing the amount of data to monitor future-generation extreme-scale...
This research describes Fountain, a suite of programs used to monitor the resources of a cluster. A ...
Virtual Networks are characterised as highly dynamic network environments, where topologies and node...
Data centers supporting cloud-based services are characterized by a huge number of hardware and soft...
In order to assess the overall service quality in real time, the performance metrics of a distribute...
textScalable system monitoring is a fundamental abstraction for large-scale networked systems. The g...
In order to assess service quality of a networked application (such as a streaming session), distrib...
In this paper, we present a structure for monitoring a large set of computational clusters. We illus...
Large-scale networked systems, such as the Internet and server clusters, are omnipresent today. They...
The focus of this thesis is continuous real-time monitoring, which is essential for the realization ...
Monitoring systems give network administrators a better view and understanding of their networks. Am...
Monitoring systems are necessary for the management of anything beyond the smallest networks of comp...
Akademisk avhandling som med tillstånd av Kungl Tekniska högskolan framlägges till offentlig granskn...
Real-time monitoring is increasingly becoming impor-tant in various scenes of large scale, multi-sit...
Large scale computer clusters have during the last years become dominant for making computations in ...
This research describes Fountain, a suite of software used to monitor the resources of a cluster. A ...
This research describes Fountain, a suite of programs used to monitor the resources of a cluster. A ...
Virtual Networks are characterised as highly dynamic network environments, where topologies and node...
Data centers supporting cloud-based services are characterized by a huge number of hardware and soft...
In order to assess the overall service quality in real time, the performance metrics of a distribute...
textScalable system monitoring is a fundamental abstraction for large-scale networked systems. The g...
In order to assess service quality of a networked application (such as a streaming session), distrib...
In this paper, we present a structure for monitoring a large set of computational clusters. We illus...
Large-scale networked systems, such as the Internet and server clusters, are omnipresent today. They...
The focus of this thesis is continuous real-time monitoring, which is essential for the realization ...
Monitoring systems give network administrators a better view and understanding of their networks. Am...
Monitoring systems are necessary for the management of anything beyond the smallest networks of comp...
Akademisk avhandling som med tillstånd av Kungl Tekniska högskolan framlägges till offentlig granskn...
Real-time monitoring is increasingly becoming impor-tant in various scenes of large scale, multi-sit...
Large scale computer clusters have during the last years become dominant for making computations in ...
This research describes Fountain, a suite of software used to monitor the resources of a cluster. A ...
This research describes Fountain, a suite of programs used to monitor the resources of a cluster. A ...
Virtual Networks are characterised as highly dynamic network environments, where topologies and node...
Data centers supporting cloud-based services are characterized by a huge number of hardware and soft...