International audienceGrids reliability remains an order of magnitude below clusters on production infrastructures. This work is aims at improving grid application performances by improving the job submission system. A stochastic model, capturing the behavior of a complex grid workload management system is proposed. To instantiate the model, detailed statistics are extracted from dense grid activity traces. The model is exploited in a simple job resubmission strategy. It provides quantitative inputs to improve job submission performance and it enables quantifying the impact of faults and outliers on grid operations
International audienceThe ever increasing scale and complexity of large computational systems ask fo...
International audienceProduction grids have a potential for parallel execution of a very large numbe...
International audienceIn this paper, we study grid jobs latency. Together with outliers, latency hig...
International audienceIt is commonly observed that production grids are inherently unreliable. The a...
Research Report I3S laboratory, number I3S/RR-2009-17-FR, Sophia AntipolisProduction-grid users expe...
International audienceProduction-grid users experience many system faults as well as high and variab...
In this paper, we examine how the execution context of grid jobs can help to refine submission strat...
International audienceIn this paper, we study grid job submission latencies. The latency highly impa...
International audienceDespite extensive research focused on enabling QoS for grid users through econ...
Production grids are complex and highly variable systems whose behavior is not well understood and d...
Thousands of scientific users witness every day inherent instabilities and bottlenecks of large-scal...
Scientic communities are using a growing number of distributed systems, from lo- cal batch systems, ...
theme du Workshop : Workload characterization and modellingWith Grids, we are able to share computin...
Independent observations and everyday user experience indicate that performance and reliability of l...
Independent observations and everyday user experience indicate that performance and reliability of l...
International audienceThe ever increasing scale and complexity of large computational systems ask fo...
International audienceProduction grids have a potential for parallel execution of a very large numbe...
International audienceIn this paper, we study grid jobs latency. Together with outliers, latency hig...
International audienceIt is commonly observed that production grids are inherently unreliable. The a...
Research Report I3S laboratory, number I3S/RR-2009-17-FR, Sophia AntipolisProduction-grid users expe...
International audienceProduction-grid users experience many system faults as well as high and variab...
In this paper, we examine how the execution context of grid jobs can help to refine submission strat...
International audienceIn this paper, we study grid job submission latencies. The latency highly impa...
International audienceDespite extensive research focused on enabling QoS for grid users through econ...
Production grids are complex and highly variable systems whose behavior is not well understood and d...
Thousands of scientific users witness every day inherent instabilities and bottlenecks of large-scal...
Scientic communities are using a growing number of distributed systems, from lo- cal batch systems, ...
theme du Workshop : Workload characterization and modellingWith Grids, we are able to share computin...
Independent observations and everyday user experience indicate that performance and reliability of l...
Independent observations and everyday user experience indicate that performance and reliability of l...
International audienceThe ever increasing scale and complexity of large computational systems ask fo...
International audienceProduction grids have a potential for parallel execution of a very large numbe...
International audienceIn this paper, we study grid jobs latency. Together with outliers, latency hig...