This paper proposes DeepSynth, a method for effective training of deep Reinforcement Learning (RL) agents when the reward is sparse and non-Markovian, but at the same time progress towards the reward requires achieving an unknown sequence of high-level objectives. Our method employs a novel algorithm for synthesis of compact automata to uncover this sequential structure automatically. We synthesise a human-interpretable automaton from trace data collected by exploring the environment. The state space of the environment is then enriched with the synthesised automaton so that the generation of a control policy by deep RL is guided by the discovered structure encoded in the automaton. The proposed approach is able to cope with both high-dimen...
International audienceDeep learning has provided new ways of manipulating, processing and analyzing ...
Since DeepMind pioneered a deep reinforcement learning (DRL) model to play the Atari games, DRL has ...
Reinforcement learning has shown great potential in generalizing over raw sensory data using only a ...
This paper proposes DeepSynth, a method for effective training of deep Reinforcement Learning (RL) a...
The development of reinforcement learning attracts more and more attention among researchers. Levera...
Reinforcement learning (RL) is a general framework for learning and evaluating intelligent behaviors...
Reinforcement Learning (RL) represents a very promising field in the umbrella of Machine Learning (M...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
For robots to perform tasks in the unstructured environments of the real world, they must be able to...
We present a data-efficient framework for solving sequential decision-making problems which exploits...
In control, the objective is to find a mapping from states to actions that steer a system to a desir...
While recent advances in deep reinforcement learning have allowed autonomous learning agents to succ...
Deep reinforcement learning utilizes deep neural networks as the function approximator to model the ...
Reinforcement Learning (RL) is a framework to deal with decision-making problems with the goal of fi...
Reinforcement learning (RL) is a broad family of algorithms for training autonomous agents to collec...
International audienceDeep learning has provided new ways of manipulating, processing and analyzing ...
Since DeepMind pioneered a deep reinforcement learning (DRL) model to play the Atari games, DRL has ...
Reinforcement learning has shown great potential in generalizing over raw sensory data using only a ...
This paper proposes DeepSynth, a method for effective training of deep Reinforcement Learning (RL) a...
The development of reinforcement learning attracts more and more attention among researchers. Levera...
Reinforcement learning (RL) is a general framework for learning and evaluating intelligent behaviors...
Reinforcement Learning (RL) represents a very promising field in the umbrella of Machine Learning (M...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
For robots to perform tasks in the unstructured environments of the real world, they must be able to...
We present a data-efficient framework for solving sequential decision-making problems which exploits...
In control, the objective is to find a mapping from states to actions that steer a system to a desir...
While recent advances in deep reinforcement learning have allowed autonomous learning agents to succ...
Deep reinforcement learning utilizes deep neural networks as the function approximator to model the ...
Reinforcement Learning (RL) is a framework to deal with decision-making problems with the goal of fi...
Reinforcement learning (RL) is a broad family of algorithms for training autonomous agents to collec...
International audienceDeep learning has provided new ways of manipulating, processing and analyzing ...
Since DeepMind pioneered a deep reinforcement learning (DRL) model to play the Atari games, DRL has ...
Reinforcement learning has shown great potential in generalizing over raw sensory data using only a ...