In this paper we describe the architecture of PerfMC, a performance monitoring system for clusters of workstations; a prototype implementation of the architecture is also presented. PerfMC is driven by an XML configuration file, and uses the Simple Network Management Protocol (SNMP) to collect statistics from each networked equipment. The collected data are maintained on the local disk of the monitoring station in a compact format, and various graphical and statistical analyses can be performed off-line. The monitoring tool embeds an HTTP server which is able to generate various types of graphs from the collected data. Moreover the HTTP server can generate arbitrary XML pages by dynamically applying XSLT stylesheets to an internal XML repre...
Monitoring is at the heart of cluster management. Instrumentation data is used to schedule tasks, lo...
Global scientific collaborations, such as ATLAS, continue to push the network requirements envelope....
This paper introduces an infrastructure for efficiently collecting performance profiles from paralle...
In this paper we describe the architecture of PerfMC, a performance monitoring system for clusters o...
Large scale computer clusters have during the last years become dominant for making computations in ...
Abstract. We present PerfMiner, a system for the transparent collection, storage and presentation of...
The CMS experiment's online cluster consists of 2300 computers and 170 switches or routers operating...
This research describes Fountain, a suite of programs used to monitor the resources of a cluster. A ...
This research describes Fountain, a suite of software used to monitor the resources of a cluster. A ...
Project (M.S., Computer Science) -- California State University, Sacramento, 2011.The basic goal of ...
This paper reports on the design and implementation of the HPC performance monitoring system deploye...
The use of a cluster for distributed performance analy-sis of parallel trace data is discussed. We p...
The ATLAS TDAQ Network consists of three separate networks spanning four levels of the experimental ...
The WLCG infrastructure moved from a very rigid network topology, based on the MONARC model, to a mo...
perfSONAR is a web services-based infrastructure for collecting and publishing network performance ...
Monitoring is at the heart of cluster management. Instrumentation data is used to schedule tasks, lo...
Global scientific collaborations, such as ATLAS, continue to push the network requirements envelope....
This paper introduces an infrastructure for efficiently collecting performance profiles from paralle...
In this paper we describe the architecture of PerfMC, a performance monitoring system for clusters o...
Large scale computer clusters have during the last years become dominant for making computations in ...
Abstract. We present PerfMiner, a system for the transparent collection, storage and presentation of...
The CMS experiment's online cluster consists of 2300 computers and 170 switches or routers operating...
This research describes Fountain, a suite of programs used to monitor the resources of a cluster. A ...
This research describes Fountain, a suite of software used to monitor the resources of a cluster. A ...
Project (M.S., Computer Science) -- California State University, Sacramento, 2011.The basic goal of ...
This paper reports on the design and implementation of the HPC performance monitoring system deploye...
The use of a cluster for distributed performance analy-sis of parallel trace data is discussed. We p...
The ATLAS TDAQ Network consists of three separate networks spanning four levels of the experimental ...
The WLCG infrastructure moved from a very rigid network topology, based on the MONARC model, to a mo...
perfSONAR is a web services-based infrastructure for collecting and publishing network performance ...
Monitoring is at the heart of cluster management. Instrumentation data is used to schedule tasks, lo...
Global scientific collaborations, such as ATLAS, continue to push the network requirements envelope....
This paper introduces an infrastructure for efficiently collecting performance profiles from paralle...