Abstract. Reinforcement learning is bedeviled by the curse of dimensionality: the number of parameters to be learned grows exponentially with the size of any compact encoding of a state. Recent attempts to combat the curse of dimensionality have turned to principled ways of exploiting temporal abstraction, where decisions are not required at each step, but rather invoke the execution of temporally-extended activities which follow their own policies until termination. This leads naturally to hierarchical control architectures and associated learning algorithms. We review several approaches to temporal abstraction and hierarchical organization that machine learning researchers have recently developed. Common to these approaches is a reliance ...
Hierarchical Reinforcement Learning (HRL) algorithms can perform planning at multiple levels of abst...
To solve partially observable Markov decision problems, we introduce HQ-learning, a hierarchical ext...
A hierarchical representation of the input-output transition function in a learning system is sugges...
Reinforcement learning (RL) is an area of Machine Learning (ML) concerned with learning how a softwa...
This paper describes two methods for hierarchically organizing temporal behaviors. The first is more...
This thesis addresses the open problem of automatically discovering hierarchical structure in reinfo...
Common approaches to Reinforcement Learning (RL) are seriously challenged by large-scale application...
This paper presents a new method for the autonomous construction of hierarchical action and state re...
Humans use prior knowledge to efficiently solve novel tasks, but how they structure past knowledge d...
This dissertation investigates the use of hierarchy and abstraction as a means of solving complex se...
Reinforcement learning provides a means for autonomous agents to improve their action selection stra...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Hierarchical Reinforcement Learning (HRL) algorithms can perform planning at multiple levels of abst...
Over the past decade, reinforcement learning (RL; e.g., see [9]) has been an active area of AI resea...
Hierarchical Reinforcement Learning (HRL) algorithms can perform planning at multiple levels of abst...
To solve partially observable Markov decision problems, we introduce HQ-learning, a hierarchical ext...
A hierarchical representation of the input-output transition function in a learning system is sugges...
Reinforcement learning (RL) is an area of Machine Learning (ML) concerned with learning how a softwa...
This paper describes two methods for hierarchically organizing temporal behaviors. The first is more...
This thesis addresses the open problem of automatically discovering hierarchical structure in reinfo...
Common approaches to Reinforcement Learning (RL) are seriously challenged by large-scale application...
This paper presents a new method for the autonomous construction of hierarchical action and state re...
Humans use prior knowledge to efficiently solve novel tasks, but how they structure past knowledge d...
This dissertation investigates the use of hierarchy and abstraction as a means of solving complex se...
Reinforcement learning provides a means for autonomous agents to improve their action selection stra...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Hierarchical Reinforcement Learning (HRL) algorithms can perform planning at multiple levels of abst...
Over the past decade, reinforcement learning (RL; e.g., see [9]) has been an active area of AI resea...
Hierarchical Reinforcement Learning (HRL) algorithms can perform planning at multiple levels of abst...
To solve partially observable Markov decision problems, we introduce HQ-learning, a hierarchical ext...
A hierarchical representation of the input-output transition function in a learning system is sugges...