Deep neural networks training jobs and other iterative computations frequently include checkpoints where jobs can be canceled based on the current value of monitored metrics. While most of existing results focus on the performance of all jobs (both successfully completed and canceled), in this work we explore scheduling policies that improve the sojourn time of successful jobs, which are typically more valuable to the user. Our model assumes that each job has a known discrete size distribution (e.g., estimated from previous execution logs) where the largest size value indicates a successful completion, while other size values correspond to termination checkpoints. In the single-server case where all jobs are available for scheduling simulta...
A single server is faced with a collection of jobs of varying duration and urgency. Before service s...
The operational cost of a cloud computing platform is one of the most significant Quality of Service...
A single server processes jobs that can yield rewards but expire on predetermined dates. Expected im...
International audienceIn this paper, we are interested in scheduling and checkpointing stochastic jo...
A single server is faced with a collection of jobs of varying duration and urgency. Each job has a r...
This paper focuses on the competitive analysis of scheduling disciplines in a large deviations setti...
The study of size-based and size-oblivious scheduling policies with inaccurate job size information ...
International audienceWe provide a queueing-theoretic framework for job replication schemes based on...
Early exit neural networks (EENs) reduce the processing times of deep convolutional neural networks ...
In non-clairvoyant scheduling, the task is to find an online strategy for scheduling jobs with a pri...
Recently, the so-called class of SMART scheduling policies has been introduced to formalize the comm...
Scheduled batch jobs have been widely used on the asynchronous computing platforms to execute variou...
Motivated by issues raised from data broadcast and networks using ATM and TCP/IP, we consider an onl...
International audienceThis work provides an analysis of checkpointing strategies for minimizing expe...
Scheduling is the mathematical problem of allocating tasks to resources considering certain constrai...
A single server is faced with a collection of jobs of varying duration and urgency. Before service s...
The operational cost of a cloud computing platform is one of the most significant Quality of Service...
A single server processes jobs that can yield rewards but expire on predetermined dates. Expected im...
International audienceIn this paper, we are interested in scheduling and checkpointing stochastic jo...
A single server is faced with a collection of jobs of varying duration and urgency. Each job has a r...
This paper focuses on the competitive analysis of scheduling disciplines in a large deviations setti...
The study of size-based and size-oblivious scheduling policies with inaccurate job size information ...
International audienceWe provide a queueing-theoretic framework for job replication schemes based on...
Early exit neural networks (EENs) reduce the processing times of deep convolutional neural networks ...
In non-clairvoyant scheduling, the task is to find an online strategy for scheduling jobs with a pri...
Recently, the so-called class of SMART scheduling policies has been introduced to formalize the comm...
Scheduled batch jobs have been widely used on the asynchronous computing platforms to execute variou...
Motivated by issues raised from data broadcast and networks using ATM and TCP/IP, we consider an onl...
International audienceThis work provides an analysis of checkpointing strategies for minimizing expe...
Scheduling is the mathematical problem of allocating tasks to resources considering certain constrai...
A single server is faced with a collection of jobs of varying duration and urgency. Before service s...
The operational cost of a cloud computing platform is one of the most significant Quality of Service...
A single server processes jobs that can yield rewards but expire on predetermined dates. Expected im...