A common approach to address multiobjective problems using reinforcement learning methods is to extend model-free, value-based algorithms such as Q-learning to use a vector of Q-values in combination with an appropriate action selection mechanism that is often based on scalarisation. Most prior empirical evaluation of these approaches has focused on deterministic environments. This study examines the impact on stochasticity in rewards and state transitions on the behaviour of multi-objective Q-learning. It shows that the nature of the optimal solution depends on these environmental characteristics, and also on whether we desire to maximise the Expected Scalarised Return (ESR) or the Scalarised Expected Return (SER). We also identify a novel...
We address the problem of reinforcement learning in which observations may exhibit an arbitrary form...
Despite growing interest over recent years in applying reinforcement learning to multiobjective prob...
In recent decades, Reinforcement Learning (RL) has emerged as an effective approach to address compl...
Multiobjective reinforcement learning algorithms extend reinforcement learning techniques to problem...
In this paper, we give a brief review of Markov Decision Processes (MDPs), and how Reinforcement Lea...
Many real-world problems involve the optimization of multiple, possibly conflicting ob-jectives. Mul...
In many real-world scenarios, the utility of a user is derived from the single execution of a policy...
For reinforcement learning tasks with multiple objectives, it may be advantageous to learn stochasti...
In this article we study the connection of stochastic optimal control and reinforcement learning. Ou...
This work describes MPQ-learning, an temporal-difference method that approximates the set of all non...
Many real-life problems involve dealing with multiple objectives. For example, in network routing th...
Markov decision processes are sequential decision-making processes in which the learning agent sense...
Reinforcement Learning (RL) in either fully or partially observable domains usually poses a requirem...
© 2020 IEEE. A common approach for defining a reward function for multi-objective reinforcement lear...
We study fair multi-objective reinforcement learning in which an agent must learn a policy that simu...
We address the problem of reinforcement learning in which observations may exhibit an arbitrary form...
Despite growing interest over recent years in applying reinforcement learning to multiobjective prob...
In recent decades, Reinforcement Learning (RL) has emerged as an effective approach to address compl...
Multiobjective reinforcement learning algorithms extend reinforcement learning techniques to problem...
In this paper, we give a brief review of Markov Decision Processes (MDPs), and how Reinforcement Lea...
Many real-world problems involve the optimization of multiple, possibly conflicting ob-jectives. Mul...
In many real-world scenarios, the utility of a user is derived from the single execution of a policy...
For reinforcement learning tasks with multiple objectives, it may be advantageous to learn stochasti...
In this article we study the connection of stochastic optimal control and reinforcement learning. Ou...
This work describes MPQ-learning, an temporal-difference method that approximates the set of all non...
Many real-life problems involve dealing with multiple objectives. For example, in network routing th...
Markov decision processes are sequential decision-making processes in which the learning agent sense...
Reinforcement Learning (RL) in either fully or partially observable domains usually poses a requirem...
© 2020 IEEE. A common approach for defining a reward function for multi-objective reinforcement lear...
We study fair multi-objective reinforcement learning in which an agent must learn a policy that simu...
We address the problem of reinforcement learning in which observations may exhibit an arbitrary form...
Despite growing interest over recent years in applying reinforcement learning to multiobjective prob...
In recent decades, Reinforcement Learning (RL) has emerged as an effective approach to address compl...