The Job Execution Monitor (JEM) is a job-centric grid job monitoring software developed at the University of Wuppertal and integrated into the pilot-based “PanDA” job brokerage system leveraging physics analysis and Monte Carlo event production for the ATLAS experiment on the Worldwide LHC Computing Grid (WLCG). With JEM, job progress and grid worker node health can be supervised in real time by users, site admins and shift personnel. Imminent error conditions can be detected early and countermeasures can be initiated by the Job’s owner immideatly. Grid site admins can access aggregated data of all monitored jobs to infer the site status and to detect job and Grid worker node misbehaviour. Shifters can use the same aggregated data to quickl...
ATLAS is the largest experiment at the LHC. It generates vast volumes of scientific data accompanied...
GRID environments are privileged targets for computation-intensive problem solving in areas from wea...
Processing ATLAS event data requires a wide variety of auxiliary information from geometry, trigger,...
The Job Execution Monitor (JEM) is a job-centric grid job monitoring software developed at the Unive...
hsnr.de physik.uni-wuppertal.de Experience with generating simulation data of high energy physics ex...
Monitoring of the large-scale data processing of the ATLAS experiment includes monitoring of product...
large scale computing grid LCG (LHC Computing Grid) is meant to process and store the large amount o...
Error handling is a crucial task in an infrastructure as complex as a grid. There are several monito...
International audienceThe ever increasing scale and complexity of large computational systems ask fo...
The ATLAS experiment is a High Energy Physics experiment that utilizes the services of Grid3 now mig...
Abstract The Worldwide LHC Computing Grid (WLCG) includes more than 170 grid and cloud computing ce...
To process large amounts of data produced at the Large Hadron Col-lider at CERN, users typically sub...
Processing of large data sets with high through put is one of the major focus of Grid computing toda...
In a wide-area distributed and heterogeneous grid environment, monitoring plays an important and cru...
In this paper we present an unsupervised learning approach to detect meaningful job traffic patterns...
ATLAS is the largest experiment at the LHC. It generates vast volumes of scientific data accompanied...
GRID environments are privileged targets for computation-intensive problem solving in areas from wea...
Processing ATLAS event data requires a wide variety of auxiliary information from geometry, trigger,...
The Job Execution Monitor (JEM) is a job-centric grid job monitoring software developed at the Unive...
hsnr.de physik.uni-wuppertal.de Experience with generating simulation data of high energy physics ex...
Monitoring of the large-scale data processing of the ATLAS experiment includes monitoring of product...
large scale computing grid LCG (LHC Computing Grid) is meant to process and store the large amount o...
Error handling is a crucial task in an infrastructure as complex as a grid. There are several monito...
International audienceThe ever increasing scale and complexity of large computational systems ask fo...
The ATLAS experiment is a High Energy Physics experiment that utilizes the services of Grid3 now mig...
Abstract The Worldwide LHC Computing Grid (WLCG) includes more than 170 grid and cloud computing ce...
To process large amounts of data produced at the Large Hadron Col-lider at CERN, users typically sub...
Processing of large data sets with high through put is one of the major focus of Grid computing toda...
In a wide-area distributed and heterogeneous grid environment, monitoring plays an important and cru...
In this paper we present an unsupervised learning approach to detect meaningful job traffic patterns...
ATLAS is the largest experiment at the LHC. It generates vast volumes of scientific data accompanied...
GRID environments are privileged targets for computation-intensive problem solving in areas from wea...
Processing ATLAS event data requires a wide variety of auxiliary information from geometry, trigger,...