We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives. There exist two different views: (i) the expectation semantics, where the goal is to optimize the expected mean-payoff objective, and (ii) the satisfaction semantics, where the goal is to maximize the probability of runs such that the mean-payoff value stays above a given vector. We consider optimization with respect to both objectives at once, thus unifying the existing semantics. Precisely, the goal is to optimize the expectation while ensuring the satisfaction constraint. Our problem captures the notion of optimization with respect to strategies that are risk-averse (i.e., ensure certain probabilistic guarantee). Our main results are as ...
We study countably infinite Markov decision processes (MDPs) with real-valued transition rewards. Ev...
International audienceMarkov decision processes (MDPs) are controllable discrete event systems with ...
We study countably infinite Markov decision processes (MDPs) with real-valuedtransition rewards. Eve...
We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives...
We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectiv...
We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives...
We study Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) functions. We...
We study Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) functions. We...
We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives...
We study Markov decision processes (MDPs) with multiple limit-average (ormean-payoff) functions. We ...
We consider finite horizon Markov decision processes under performance measures that involve both th...
We study countably infinite Markov decision processes (MDPs) with real-valued transition rewards. Ev...
We formalize the problem of maximizing the mean-payoff value with high probability while satisfying ...
We study the expected value of the window mean-payoff measure in Markov decision processes (MDPs) an...
We formalize the problem of maximizing the mean-payo value with high probability while satisfying a ...
We study countably infinite Markov decision processes (MDPs) with real-valued transition rewards. Ev...
International audienceMarkov decision processes (MDPs) are controllable discrete event systems with ...
We study countably infinite Markov decision processes (MDPs) with real-valuedtransition rewards. Eve...
We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives...
We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectiv...
We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives...
We study Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) functions. We...
We study Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) functions. We...
We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives...
We study Markov decision processes (MDPs) with multiple limit-average (ormean-payoff) functions. We ...
We consider finite horizon Markov decision processes under performance measures that involve both th...
We study countably infinite Markov decision processes (MDPs) with real-valued transition rewards. Ev...
We formalize the problem of maximizing the mean-payoff value with high probability while satisfying ...
We study the expected value of the window mean-payoff measure in Markov decision processes (MDPs) an...
We formalize the problem of maximizing the mean-payo value with high probability while satisfying a ...
We study countably infinite Markov decision processes (MDPs) with real-valued transition rewards. Ev...
International audienceMarkov decision processes (MDPs) are controllable discrete event systems with ...
We study countably infinite Markov decision processes (MDPs) with real-valuedtransition rewards. Eve...