We introduce Dynamic Planning Networks (DPN), a novel architecture for deep reinforcement learning, that combines model-based and model-free aspects for online planning. Our architecture learns to dynamically construct plans using a learned state-transition model by selecting and traversing between simulated states and actions to maximize information before acting. DPN learns to efficiently form plans by expanding a single action conditional state transition at a time instead of exhaustively evaluating each action, reducing the number of state-transitions used during planning. We observe emergent planning patterns in our agent, including classical search methods such as breadth-first and depth-first search. DPN shows improved performance ov...
© 2016 IEEE. This letter introduces a hybrid algorithm of deep reinforcement learning (RL) and Force...
There are many different methods for the deliberative control of autonomous systems in stochastic en...
We study using reinforcement learning in particular dynamic environ-ments. Our environments can cont...
This thesis presents the design, implementation and investigation of some predictive-planning contro...
This paper investigates a new approach to model-based reinforcement learning using background planni...
We present a new approach to learning for planning, where knowledge acquired while solving a given s...
We train embodied neural networks to plan and navigate unseen complex 3D environments, emphasising r...
Recent advances in applying deep learning to planning have shown that Deep Reactive Policies (DRPs) ...
For many real-world automated planning problems, it is difficult to obtain a transition model that g...
Abstract. Reinforcement learning (RL) involves sequential decision making in uncertain environments....
My research activity focuses on the integration of acting, learning and planning. The main objective...
Reinforcement Learning (RL) has seen exponential performance improvements over the past decade, achi...
State-of-the-art probabilistic planners typically apply look- ahead search and reasoning at each ste...
Deep reinforcement learning methods are capable of learning complex heuristics starting with no prio...
Deep Reinforcement Learning (DRL), is becoming a popular and mature framework for learning to solve ...
© 2016 IEEE. This letter introduces a hybrid algorithm of deep reinforcement learning (RL) and Force...
There are many different methods for the deliberative control of autonomous systems in stochastic en...
We study using reinforcement learning in particular dynamic environ-ments. Our environments can cont...
This thesis presents the design, implementation and investigation of some predictive-planning contro...
This paper investigates a new approach to model-based reinforcement learning using background planni...
We present a new approach to learning for planning, where knowledge acquired while solving a given s...
We train embodied neural networks to plan and navigate unseen complex 3D environments, emphasising r...
Recent advances in applying deep learning to planning have shown that Deep Reactive Policies (DRPs) ...
For many real-world automated planning problems, it is difficult to obtain a transition model that g...
Abstract. Reinforcement learning (RL) involves sequential decision making in uncertain environments....
My research activity focuses on the integration of acting, learning and planning. The main objective...
Reinforcement Learning (RL) has seen exponential performance improvements over the past decade, achi...
State-of-the-art probabilistic planners typically apply look- ahead search and reasoning at each ste...
Deep reinforcement learning methods are capable of learning complex heuristics starting with no prio...
Deep Reinforcement Learning (DRL), is becoming a popular and mature framework for learning to solve ...
© 2016 IEEE. This letter introduces a hybrid algorithm of deep reinforcement learning (RL) and Force...
There are many different methods for the deliberative control of autonomous systems in stochastic en...
We study using reinforcement learning in particular dynamic environ-ments. Our environments can cont...