A fundamental problem in reinforcement learning is balancing exploration and exploitation. We address this problem in the context of model-based reinforcement learning in large stochastic relational domains by developing relational extensions of the concepts of the E-3 and R-MAX algorithms. Efficient exploration in exponentially large state spaces needs to exploit the generalization of the learned model: what in a propositional setting would be considered a novel situation and worth exploration may in the relational setting be a well-known context in which exploitation is promising. To address this we introduce relational count functions which generalize the classical notion of state and action visitation counts. We provide guarantees on th...
Relational reinforcement learning combines traditional rein-forcement learning with a strong emphasi...
Model-based reinforcement learning is a powerful paradigm for learning tasks in robotics. However, i...
We introduce an approach to model-free relational reinforcement learning in finitehorizon, undiscoun...
Abstract. One of the key problems in model-based reinforcement learn-ing is balancing exploration an...
In recent years, there has been a growing interest in using rich representations such as relational...
Probabilistic planners have improved recently to the point that they can solve difficult tasks with ...
The exploration-exploitation tradeoff is crucial to reinforcement-learning (RL) agents, and a signi...
International audienceProbabilistic planners have improved recently to the point that they can solve...
The exploration-exploitation tradeoff is crucial to reinforcement-learning (RL) agents, and a signif...
AI researchers have long studied algorithms for plan-ning and learning-to-plan within highly structu...
In recent years, there has been a growing interest in using rich representations such as relational ...
Reinforcement learning has developed into a primary approach for learning control strategies for aut...
Reinforcement learning has developed into a primary approach for learning control strategies for aut...
Abstract. In reinforcement learning, an agent tries to learn a policy, i.e., how to select an action...
Relational reinforcement learning is presented, a learning technique that combines reinforcement lea...
Relational reinforcement learning combines traditional rein-forcement learning with a strong emphasi...
Model-based reinforcement learning is a powerful paradigm for learning tasks in robotics. However, i...
We introduce an approach to model-free relational reinforcement learning in finitehorizon, undiscoun...
Abstract. One of the key problems in model-based reinforcement learn-ing is balancing exploration an...
In recent years, there has been a growing interest in using rich representations such as relational...
Probabilistic planners have improved recently to the point that they can solve difficult tasks with ...
The exploration-exploitation tradeoff is crucial to reinforcement-learning (RL) agents, and a signi...
International audienceProbabilistic planners have improved recently to the point that they can solve...
The exploration-exploitation tradeoff is crucial to reinforcement-learning (RL) agents, and a signif...
AI researchers have long studied algorithms for plan-ning and learning-to-plan within highly structu...
In recent years, there has been a growing interest in using rich representations such as relational ...
Reinforcement learning has developed into a primary approach for learning control strategies for aut...
Reinforcement learning has developed into a primary approach for learning control strategies for aut...
Abstract. In reinforcement learning, an agent tries to learn a policy, i.e., how to select an action...
Relational reinforcement learning is presented, a learning technique that combines reinforcement lea...
Relational reinforcement learning combines traditional rein-forcement learning with a strong emphasi...
Model-based reinforcement learning is a powerful paradigm for learning tasks in robotics. However, i...
We introduce an approach to model-free relational reinforcement learning in finitehorizon, undiscoun...