In many real-world applications of reinforcement learning (RL), performing actions requires consuming certain types of resources that are non-replenishable in each episode. Typical applications include robotic control with limited energy and video games with consumable items. In tasks with non-replenishable resources, we observe that popular RL methods such as soft actor critic suffer from poor sample efficiency. The major reason is that, they tend to exhaust resources fast and thus the subsequent exploration is severely restricted due to the absence of resources. To address this challenge, we first formalize the aforementioned problem as a resource-restricted reinforcement learning, and then propose a novel resource-aware exploration bonus...
A fundamental challenge for reinforcement learning (RL) is how to achieve effcient exploration in in...
The general assumption in reinforcement learning(RL) that agents are free to explore for searching o...
In this thesis, we discuss various techniques for improving exploration for deep reinforcement learn...
Reinforcement learning (RL) aims to learn optimal behaviors for agents to maximize cumulative reward...
International audienceRealistic environments often provide agents with very limited feedback. When t...
Recent Reinforcement Learning (RL) algorithms, such as R-MAX, make (with high probability) only a sm...
Exploration plays a fundamental role in any active learning system. This study evaluates the role of...
Reinforcement learning is a powerful approach for learning control policies that solve sequential de...
International audienceOne of the challenges in online reinforcement learning (RL) is that the agent ...
Reinforcement learning (RL) agents need to explore their environments in order to learn optimal poli...
Equipping artificial agents with useful exploration mechanisms remains a challenge to this day. Huma...
Deep Reinforcement Learning (DRL) and Deep Multi-agent Reinforcement Learning (MARL) have achieved s...
Exploration is essential for reinforcement learning (RL). To face the challenges of exploration, we ...
Sparse reward is one of the biggest challenges in reinforcement learning (RL). In this paper, we pro...
Over the course of the last decade, the framework of reinforcement learning has developed into a pro...
A fundamental challenge for reinforcement learning (RL) is how to achieve effcient exploration in in...
The general assumption in reinforcement learning(RL) that agents are free to explore for searching o...
In this thesis, we discuss various techniques for improving exploration for deep reinforcement learn...
Reinforcement learning (RL) aims to learn optimal behaviors for agents to maximize cumulative reward...
International audienceRealistic environments often provide agents with very limited feedback. When t...
Recent Reinforcement Learning (RL) algorithms, such as R-MAX, make (with high probability) only a sm...
Exploration plays a fundamental role in any active learning system. This study evaluates the role of...
Reinforcement learning is a powerful approach for learning control policies that solve sequential de...
International audienceOne of the challenges in online reinforcement learning (RL) is that the agent ...
Reinforcement learning (RL) agents need to explore their environments in order to learn optimal poli...
Equipping artificial agents with useful exploration mechanisms remains a challenge to this day. Huma...
Deep Reinforcement Learning (DRL) and Deep Multi-agent Reinforcement Learning (MARL) have achieved s...
Exploration is essential for reinforcement learning (RL). To face the challenges of exploration, we ...
Sparse reward is one of the biggest challenges in reinforcement learning (RL). In this paper, we pro...
Over the course of the last decade, the framework of reinforcement learning has developed into a pro...
A fundamental challenge for reinforcement learning (RL) is how to achieve effcient exploration in in...
The general assumption in reinforcement learning(RL) that agents are free to explore for searching o...
In this thesis, we discuss various techniques for improving exploration for deep reinforcement learn...