Planning and reinforcement learning are two key approaches to sequential decision making. Multi-step approximate real-time dynamic programming, a recently successful algorithm class of which AlphaZero [Silver et al., 2018] is an example, combines both by nesting planning within a learning loop. However, the combination of planning and learning introduces a new question: how should we balance time spend on planning, learning and acting? The importance of this trade-off has not been explicitly studied before. We show that it is actually of key importance, with computational results indicating that we should neither plan too long nor too short. Conceptually, we identify a new spectrum of planning-learning algorithms which ranges from exhaustiv...
Deep reinforcement learning agents such as AlphaZero have achieved superhuman strength in complex co...
Model-based reinforcement learning includes two steps, estimation of a plant and planning. Planning ...
Reinforcement learning (RL) is a state-of-the-art approach to solving sequential decision-making pro...
Abstract. Reinforcement learning (RL) involves sequential decision making in uncertain environments....
Abstract. Reinforcement learning (RL) involves sequential decision making in uncertain environments....
Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a k...
Search based planners such as A* and Dijkstra\u27s algorithm are proven methods for guiding today\u2...
Search based planners such as A* and Dijkstra\u27s algorithm are proven methods for guiding today\u2...
Search based planners such as A* and Dijkstra\u27s algorithm are proven methods for guiding today\u2...
The practical application of learning agents requires sample efficient and interpretable algorithms....
Two common approaches to sequential decision-making are AI planning (AIP) and reinforcement learning...
Reinforcement learning (RL) agents can reduce learning time dramatically by planning with learned pr...
This article presents a detailed survey on Artificial Intelligent approaches, that combine Reinforce...
Learningshows great promise to extend the generality and effectiveness of planning techniques. Resea...
We consider how to learn Hierarchical Task Networks (HTNs) for planning problems in which both the q...
Deep reinforcement learning agents such as AlphaZero have achieved superhuman strength in complex co...
Model-based reinforcement learning includes two steps, estimation of a plant and planning. Planning ...
Reinforcement learning (RL) is a state-of-the-art approach to solving sequential decision-making pro...
Abstract. Reinforcement learning (RL) involves sequential decision making in uncertain environments....
Abstract. Reinforcement learning (RL) involves sequential decision making in uncertain environments....
Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a k...
Search based planners such as A* and Dijkstra\u27s algorithm are proven methods for guiding today\u2...
Search based planners such as A* and Dijkstra\u27s algorithm are proven methods for guiding today\u2...
Search based planners such as A* and Dijkstra\u27s algorithm are proven methods for guiding today\u2...
The practical application of learning agents requires sample efficient and interpretable algorithms....
Two common approaches to sequential decision-making are AI planning (AIP) and reinforcement learning...
Reinforcement learning (RL) agents can reduce learning time dramatically by planning with learned pr...
This article presents a detailed survey on Artificial Intelligent approaches, that combine Reinforce...
Learningshows great promise to extend the generality and effectiveness of planning techniques. Resea...
We consider how to learn Hierarchical Task Networks (HTNs) for planning problems in which both the q...
Deep reinforcement learning agents such as AlphaZero have achieved superhuman strength in complex co...
Model-based reinforcement learning includes two steps, estimation of a plant and planning. Planning ...
Reinforcement learning (RL) is a state-of-the-art approach to solving sequential decision-making pro...