A complex software environment such as the ALICE Computing Grid infrastructure requires permanent control and management for the large set of services involved. Automating control procedures reduces the human interaction with the various components of the system and yields better availability of the overall system. In this paper we will present how we used the MonALISA framework to gather, store and display the relevant metrics in the entire system from central and remote site services. We will also show the automatic local and global procedures that are triggered by the monitored values. Decision-taking agents are used to restart remote services, alert the operators in case of problems that cannot be automatically solved, submit production...
As organizations begin to deploy large computational grids, it has become apparent that systems for ...
Abstract—The centralised management of distributed comput-ing infrastructures presents a number of c...
The centralized system approach for computer and telecommunication network management has been prese...
Abstract. A complex software environment such as the ALICE Computing Grid infrastruc ture requires p...
High-Energy Physics experiments like ALICE at LHC require petabytes of storage and thousand of CP...
We are developing a general purpose monitoring system for the ALICE experiment, based on the MonALIS...
The successful administration of a global Data Grid system requires collecting and storing relevant ...
The MonALISA (Monitoring Agents in A Large Integrated Services Architecture) system provides a distr...
The MonALISA (Monitoring Agents in A Large Integrated Services Architecture) system provides a distr...
The MonALISA (Monitoring Agents in A Large Integrated Services Architecture) framework provides a se...
As organizations begin to deploy large computational grids, it has become apparent that systems for ...
Large distributed systems, such as computational grids,require a large amount of monitoring data be ...
The MonALISA (Monitoring Agents using a Large Integrated Services Architecture) framework provides a...
Performance is a critical issue in a production system accommodating hundreds of analysis users. Com...
As organizations begin to deploy large computational grids, it has become apparent that systems for ...
As organizations begin to deploy large computational grids, it has become apparent that systems for ...
Abstract—The centralised management of distributed comput-ing infrastructures presents a number of c...
The centralized system approach for computer and telecommunication network management has been prese...
Abstract. A complex software environment such as the ALICE Computing Grid infrastruc ture requires p...
High-Energy Physics experiments like ALICE at LHC require petabytes of storage and thousand of CP...
We are developing a general purpose monitoring system for the ALICE experiment, based on the MonALIS...
The successful administration of a global Data Grid system requires collecting and storing relevant ...
The MonALISA (Monitoring Agents in A Large Integrated Services Architecture) system provides a distr...
The MonALISA (Monitoring Agents in A Large Integrated Services Architecture) system provides a distr...
The MonALISA (Monitoring Agents in A Large Integrated Services Architecture) framework provides a se...
As organizations begin to deploy large computational grids, it has become apparent that systems for ...
Large distributed systems, such as computational grids,require a large amount of monitoring data be ...
The MonALISA (Monitoring Agents using a Large Integrated Services Architecture) framework provides a...
Performance is a critical issue in a production system accommodating hundreds of analysis users. Com...
As organizations begin to deploy large computational grids, it has become apparent that systems for ...
As organizations begin to deploy large computational grids, it has become apparent that systems for ...
Abstract—The centralised management of distributed comput-ing infrastructures presents a number of c...
The centralized system approach for computer and telecommunication network management has been prese...