Reinforcement learning with function approximation has recently achieved tremendous results in applications with large state spaces. This empirical success has motivated a growing body of theoretical work proposing necessary and sufficient conditions under which efficient reinforcement learning is possible. From this line of work, a remarkably simple minimal sufficient condition has emerged for sample efficient reinforcement learning: MDPs with optimal value function $V^*$ and $Q^*$ linear in some known low-dimensional features. In this setting, recent works have designed sample efficient algorithms which require a number of samples polynomial in the feature dimension and independent of the size of state space. They however leave finding co...
Reinforcement learning (RL) is a computational framework for learning sequential decision strategies...
There are several reinforcement learning algorithms that yield ap-proximate solutions for the proble...
Reinforcement learning algorithms hold promise in many complex domains, such as resource management ...
With the increasing need for handling large state and action spaces, general function approximation ...
We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on ...
We study reinforcement learning (RL) with linear function approximation. For episodic time-inhomogen...
The curse of dimensionality is a widely known issue in reinforcement learning (RL). In the tabular s...
Reward-free reinforcement learning (RL) considers the setting where the agent does not have access t...
Reinforcement Learning (RL) in finite state and action Markov Decision Processes is studied with an ...
Inspired by recent results on polynomial time reinforcement algorithms that accumulate near-optimal ...
In probably approximately correct (PAC) reinforcement learning (RL), an agent is required to identif...
Linear reinforcement learning (RL) algorithms like least-squares temporal difference learning (LSTD)...
Sample-efficient offline reinforcement learning (RL) with linear function approximation has been stu...
The Zap stochastic approximation (SA) algorithm was introduced recently as a means to accelerate con...
Reinforcement learning (RL) has recently emerged as a generic yet powerful solution for learning com...
Reinforcement learning (RL) is a computational framework for learning sequential decision strategies...
There are several reinforcement learning algorithms that yield ap-proximate solutions for the proble...
Reinforcement learning algorithms hold promise in many complex domains, such as resource management ...
With the increasing need for handling large state and action spaces, general function approximation ...
We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on ...
We study reinforcement learning (RL) with linear function approximation. For episodic time-inhomogen...
The curse of dimensionality is a widely known issue in reinforcement learning (RL). In the tabular s...
Reward-free reinforcement learning (RL) considers the setting where the agent does not have access t...
Reinforcement Learning (RL) in finite state and action Markov Decision Processes is studied with an ...
Inspired by recent results on polynomial time reinforcement algorithms that accumulate near-optimal ...
In probably approximately correct (PAC) reinforcement learning (RL), an agent is required to identif...
Linear reinforcement learning (RL) algorithms like least-squares temporal difference learning (LSTD)...
Sample-efficient offline reinforcement learning (RL) with linear function approximation has been stu...
The Zap stochastic approximation (SA) algorithm was introduced recently as a means to accelerate con...
Reinforcement learning (RL) has recently emerged as a generic yet powerful solution for learning com...
Reinforcement learning (RL) is a computational framework for learning sequential decision strategies...
There are several reinforcement learning algorithms that yield ap-proximate solutions for the proble...
Reinforcement learning algorithms hold promise in many complex domains, such as resource management ...