Concerned with multi-objective reinforcement learning (MORL), this paper presents MO-MCTS, an extension of Monte-Carlo Tree Search to multi-objective sequential decision making. The known multi-objective indicator referred to as hyper-volume indicator is used to define an action selection criterion, replacing the UCB criterion in order to deal with multi-dimensional rewards. MO-MCTS is firstly compared with an existing MORL algorithm on the artificial Deep Sea Treasure problem. Then a scalability study of MO-MCTS is made on the NP-hard problem of grid scheduling, showing that the performance of MO-MCTS matches the non RL-based state of the art albeit with a higher computational cost
This work investigates Monte-Carlo planning for agents in stochastic environments, with multiple obj...
Indicator-based evolutionary algorithms are amongst the best performing methods for solving multi-ob...
Abstract: Monte Carlo Tree Search (MCTS) is a method for making optimal decisions in artificial inte...
International audienceConcerned with multi-objective reinforcement learning (MORL), this paper prese...
International audienceConcerned with multi-objective reinforcement learning (MORL), this paper prese...
International audienceConcerned with multi-objective reinforcement learning (MORL), this paper prese...
International audienceConcerned with multi-objective reinforcement learning (MORL), this paper prese...
International audienceConcerned with multi-objective reinforcement learning (MORL), this paper prese...
This thesis is concerned with multi-Objective sequential decision making (MOSDM). The motivation is ...
This thesis is concerned with multi-Objective sequential decision making (MOSDM). The motivation is ...
This thesis is concerned with multi-Objective sequential decision making (MOSDM). The motivation is ...
This thesis is concerned with multi-Objective sequential decision making (MOSDM). The motivation is ...
This thesis is concerned with multi-Objective sequential decision making (MOSDM). The motivation is ...
Multi-Objective optimization has traditionally been applied to manufacturing, engineering or finance...
Multiobjective optimization has been traditionally a matter of study in domains like engineering or ...
This work investigates Monte-Carlo planning for agents in stochastic environments, with multiple obj...
Indicator-based evolutionary algorithms are amongst the best performing methods for solving multi-ob...
Abstract: Monte Carlo Tree Search (MCTS) is a method for making optimal decisions in artificial inte...
International audienceConcerned with multi-objective reinforcement learning (MORL), this paper prese...
International audienceConcerned with multi-objective reinforcement learning (MORL), this paper prese...
International audienceConcerned with multi-objective reinforcement learning (MORL), this paper prese...
International audienceConcerned with multi-objective reinforcement learning (MORL), this paper prese...
International audienceConcerned with multi-objective reinforcement learning (MORL), this paper prese...
This thesis is concerned with multi-Objective sequential decision making (MOSDM). The motivation is ...
This thesis is concerned with multi-Objective sequential decision making (MOSDM). The motivation is ...
This thesis is concerned with multi-Objective sequential decision making (MOSDM). The motivation is ...
This thesis is concerned with multi-Objective sequential decision making (MOSDM). The motivation is ...
This thesis is concerned with multi-Objective sequential decision making (MOSDM). The motivation is ...
Multi-Objective optimization has traditionally been applied to manufacturing, engineering or finance...
Multiobjective optimization has been traditionally a matter of study in domains like engineering or ...
This work investigates Monte-Carlo planning for agents in stochastic environments, with multiple obj...
Indicator-based evolutionary algorithms are amongst the best performing methods for solving multi-ob...
Abstract: Monte Carlo Tree Search (MCTS) is a method for making optimal decisions in artificial inte...