The rising number of executed programs (jobs) enabled by thegrowing amount of available resources from Clouds, Grids,and HPC (for example) has resulted in an enormous number ofjobs. Nowadays, most of the executed jobs are mainlyunobserved, so unusual behavior, non-optimal resource usage,and silent faults are not systematically searched andanalyzed. Job-centric monitoring enables permanent jobobservation and, thus, enables the analysis of monitoringdata. In this paper, we show how statistic functions can beused to analyze job-centric monitoring data and how themethods compare to more-complex analysis methods.Additionally, we present the usefulness of job-centricmonitoring based on practical experiences
Every day, supercomputers execute 1000s of jobs with different characteristics. Data centers monitor...
In this paper we present the concept of a scalable job centric monitoring infrastructure.The overall...
Recent developments in high energy physics (HEP) including multi-core jobs and multi-core pilots req...
The rising number of executed programs (jobs) enabled by thegrowing amount of available resources fr...
The rising number of executed programs (jobs) enabled by the growing amount of available resources f...
Processing of large data sets with high through put is one of the major focus of Grid computing toda...
HPC applications with suboptimal I/O behavior interfere with well-behaving applications and lead to...
International audienceIn HPC community the System Utilization metric enables to determine if the res...
The grid is emerging as a great computational resource but its dynamic behavior makes the Grid envi...
Rapport du stage de Magistère d'informatique préparé parallèlement à la première année de Master d'i...
With the introduction of federated data access to the workows of WLCG, it is becoming increasingly i...
Monitoring of the large-scale data processing of the ATLAS experiment includes monitoring of product...
International audienceThe ever increasing scale and complexity of large computational systems ask fo...
International audienceGrids reliability remains an order of magnitude below clusters on production i...
Large high-performance computing systems are built with increasing number of components with more CP...
Every day, supercomputers execute 1000s of jobs with different characteristics. Data centers monitor...
In this paper we present the concept of a scalable job centric monitoring infrastructure.The overall...
Recent developments in high energy physics (HEP) including multi-core jobs and multi-core pilots req...
The rising number of executed programs (jobs) enabled by thegrowing amount of available resources fr...
The rising number of executed programs (jobs) enabled by the growing amount of available resources f...
Processing of large data sets with high through put is one of the major focus of Grid computing toda...
HPC applications with suboptimal I/O behavior interfere with well-behaving applications and lead to...
International audienceIn HPC community the System Utilization metric enables to determine if the res...
The grid is emerging as a great computational resource but its dynamic behavior makes the Grid envi...
Rapport du stage de Magistère d'informatique préparé parallèlement à la première année de Master d'i...
With the introduction of federated data access to the workows of WLCG, it is becoming increasingly i...
Monitoring of the large-scale data processing of the ATLAS experiment includes monitoring of product...
International audienceThe ever increasing scale and complexity of large computational systems ask fo...
International audienceGrids reliability remains an order of magnitude below clusters on production i...
Large high-performance computing systems are built with increasing number of components with more CP...
Every day, supercomputers execute 1000s of jobs with different characteristics. Data centers monitor...
In this paper we present the concept of a scalable job centric monitoring infrastructure.The overall...
Recent developments in high energy physics (HEP) including multi-core jobs and multi-core pilots req...