While parallelism has been extensively used in Reinforcement Learning (RL), the quantitative effects of parallel exploration are not well understood theoretically. We study the benefits of simple parallel exploration for reward-free RL for linear Markov decision processes (MDPs) and two-player zero-sum Markov games (MGs). In contrast to the existing literature focused on approaches that encourage agents to explore over a diverse set of policies, we show that using a single policy to guide exploration across all agents is sufficient to obtain an almost-linear speedup in all cases compared to their fully sequential counterpart. Further, we show that this simple procedure is minimax optimal up to logarithmic factors in the reward-free setting ...
Reinforcement learning (RL) has seen widespread success in creating intelligent agents in several ch...
The main contributions in this thesis include the selectively decentralized method in solving multi-...
An important issue in reinforcement learning systems for autonomous agents is whether it makes sense...
We propose a new strategy for parallel reinforcement learning ; using this strategy, the optimal val...
Reward-free reinforcement learning (RL) considers the setting where the agent does not have access t...
We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on ...
Reinforcement learning (RL) is a machine learning paradigm where an agent learns to interact with an...
Learning for exploration/exploitation in reinforcement learning We address in this thesis the origin...
Reinforcement learning is an important family of algo-rithms that have been extremely effective in f...
Machine learning and artificial intelligence has been a hot topic the last few years, thanks to impr...
Reinforcement learning (RL) aims to learn optimal behaviors for agents to maximize cumulative reward...
High sample complexity remains a barrier to the application of reinforcement learning (RL), particul...
This paper investigates the use of experience generaliza-tion on concurrent and on-line policy learn...
International audienceRealistic environments often provide agents with very limited feedback. When t...
Reinforcement learning (RL) focuses on an essential aspect of intelligent behavior – how an agent ca...
Reinforcement learning (RL) has seen widespread success in creating intelligent agents in several ch...
The main contributions in this thesis include the selectively decentralized method in solving multi-...
An important issue in reinforcement learning systems for autonomous agents is whether it makes sense...
We propose a new strategy for parallel reinforcement learning ; using this strategy, the optimal val...
Reward-free reinforcement learning (RL) considers the setting where the agent does not have access t...
We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on ...
Reinforcement learning (RL) is a machine learning paradigm where an agent learns to interact with an...
Learning for exploration/exploitation in reinforcement learning We address in this thesis the origin...
Reinforcement learning is an important family of algo-rithms that have been extremely effective in f...
Machine learning and artificial intelligence has been a hot topic the last few years, thanks to impr...
Reinforcement learning (RL) aims to learn optimal behaviors for agents to maximize cumulative reward...
High sample complexity remains a barrier to the application of reinforcement learning (RL), particul...
This paper investigates the use of experience generaliza-tion on concurrent and on-line policy learn...
International audienceRealistic environments often provide agents with very limited feedback. When t...
Reinforcement learning (RL) focuses on an essential aspect of intelligent behavior – how an agent ca...
Reinforcement learning (RL) has seen widespread success in creating intelligent agents in several ch...
The main contributions in this thesis include the selectively decentralized method in solving multi-...
An important issue in reinforcement learning systems for autonomous agents is whether it makes sense...