I3S laboratory Research Report (I3S/RR-2006-35-FR), Sophia Antipolis, FranceThis paper presents a method to optimize the timeout value of grid computing jobs. It relies on a model of the job execution time that considers the job management system latency through a random variable. It also takes into account a proportion of outliers to model either reliable clusters or production grids characterized by faults causing jobs loss. Job management systems are first studied considering classical distributions of the latency. Different behaviors are exhibited, depending on the weight of the tail of the distribution and on the amount of outliers. Experimental results are then shown based on the latency distribution and outlier ratios measured on the...
Users and resources frequently join and leave computational grid, hence the state of the grid change...
Real-time resource scheduling is an important factor for improving the performance of cluster comput...
Scientic communities are using a growing number of distributed systems, from lo- cal batch systems, ...
I3S laboratory Research Report (I3S/RR-2006-35-FR), Sophia Antipolis, FranceThis paper presents a me...
International audienceIn this paper, we study grid jobs latency. Together with outliers, latency hig...
Abstract. In this paper, we study grid job submission latencies. The latency highly impacts performa...
International audienceIn this paper, we study grid job submission latencies. The latency highly impa...
In this paper, we examine how the execution context of grid jobs can help to refine submission strat...
Independent observations and everyday user experience indicate that performance and reliability of l...
Independent observations and everyday user experience indicate that performance and reliability of l...
Independent observations and everyday user experience indicate that performance and reliability of l...
International audienceGrids reliability remains an order of magnitude below clusters on production i...
International audienceIt is commonly observed that production grids are inherently unreliable. The a...
Large-scale distributed computing systems such as grids are serving a growing number of scientists. ...
Computational Grids are evolving into a global, service-oriented architecture – a universal platform...
Users and resources frequently join and leave computational grid, hence the state of the grid change...
Real-time resource scheduling is an important factor for improving the performance of cluster comput...
Scientic communities are using a growing number of distributed systems, from lo- cal batch systems, ...
I3S laboratory Research Report (I3S/RR-2006-35-FR), Sophia Antipolis, FranceThis paper presents a me...
International audienceIn this paper, we study grid jobs latency. Together with outliers, latency hig...
Abstract. In this paper, we study grid job submission latencies. The latency highly impacts performa...
International audienceIn this paper, we study grid job submission latencies. The latency highly impa...
In this paper, we examine how the execution context of grid jobs can help to refine submission strat...
Independent observations and everyday user experience indicate that performance and reliability of l...
Independent observations and everyday user experience indicate that performance and reliability of l...
Independent observations and everyday user experience indicate that performance and reliability of l...
International audienceGrids reliability remains an order of magnitude below clusters on production i...
International audienceIt is commonly observed that production grids are inherently unreliable. The a...
Large-scale distributed computing systems such as grids are serving a growing number of scientists. ...
Computational Grids are evolving into a global, service-oriented architecture – a universal platform...
Users and resources frequently join and leave computational grid, hence the state of the grid change...
Real-time resource scheduling is an important factor for improving the performance of cluster comput...
Scientic communities are using a growing number of distributed systems, from lo- cal batch systems, ...