How do you incentivize self-interested agents to $\textit{explore}$ when they prefer to $\textit{exploit}$? We consider complex exploration problems, where each agent faces the same (but unknown) MDP. In contrast with traditional formulations of reinforcement learning, agents control the choice of policies, whereas an algorithm can only issue recommendations. However, the algorithm controls the flow of information, and can incentivize the agents to explore via information asymmetry. We design an algorithm which explores all reachable states in the MDP. We achieve provable guarantees similar to those for incentivizing exploration in static, stateless exploration problems studied previously. To the best of our knowledge, this is the first wor...
AbstractThe basic tenet of a learning process is for an agent to learn for only as much and as long ...
This thesis presents novel work on how to improve exploration in reinforcement learning using domain...
A fundamental objective in reinforcement learning is the maintenance of a proper balance between exp...
How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal...
Learning for exploration/exploitation in reinforcement learning We address in this thesis the origin...
International audienceRealistic environments often provide agents with very limited feedback. When t...
How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal...
Balancing exploratory and exploitative behavior is an essential dilemma faced by adaptive agents. Th...
While in general trading off exploration and exploitation in reinforcement learning is hard, under s...
Over the course of the last decade, the framework of reinforcement learning has developed into a pro...
This thesis presents novel work on how to improve exploration in reinforcement learning using domain...
Reinforcement learning (RL) aims to learn optimal behaviors for agents to maximize cumulative reward...
Sparse reward is one of the biggest challenges in reinforcement learning (RL). In this paper, we pro...
Exploration plays a fundamental role in any active learning system. This study evaluates the role of...
This paper presents a model allowing to tune continual exploration in an optimal way by integrating ...
AbstractThe basic tenet of a learning process is for an agent to learn for only as much and as long ...
This thesis presents novel work on how to improve exploration in reinforcement learning using domain...
A fundamental objective in reinforcement learning is the maintenance of a proper balance between exp...
How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal...
Learning for exploration/exploitation in reinforcement learning We address in this thesis the origin...
International audienceRealistic environments often provide agents with very limited feedback. When t...
How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal...
Balancing exploratory and exploitative behavior is an essential dilemma faced by adaptive agents. Th...
While in general trading off exploration and exploitation in reinforcement learning is hard, under s...
Over the course of the last decade, the framework of reinforcement learning has developed into a pro...
This thesis presents novel work on how to improve exploration in reinforcement learning using domain...
Reinforcement learning (RL) aims to learn optimal behaviors for agents to maximize cumulative reward...
Sparse reward is one of the biggest challenges in reinforcement learning (RL). In this paper, we pro...
Exploration plays a fundamental role in any active learning system. This study evaluates the role of...
This paper presents a model allowing to tune continual exploration in an optimal way by integrating ...
AbstractThe basic tenet of a learning process is for an agent to learn for only as much and as long ...
This thesis presents novel work on how to improve exploration in reinforcement learning using domain...
A fundamental objective in reinforcement learning is the maintenance of a proper balance between exp...