We study the sample complexity of teaching, termed as ``teaching dimension" (TDim) in the literature, for the teaching-by-reinforcement paradigm, where the teacher guides the student through rewards. This is distinct from the teaching-by-demonstration paradigm motivated by robotics applications, where the teacher teaches by providing demonstrations of state/action trajectories. The teaching-by-reinforcement paradigm applies to a wider range of real-world settings where a demonstration is inconvenient, but has not been studied systematically. In this paper, we focus on a specific family of reinforcement learning algorithms, Q-learning, and characterize the TDim under different teachers with varying control power over the environment, and pr...
Hierarchical Reinforcement Learning (HRL) algorithms can perform planning at multiple levels of abst...
This paper describes several new online model-free reinforcement learning (RL) algorithms. We design...
Reinforcement learning is a machine learning method, which is an unsupervised one which situations a...
AbstractWhile most theoretical work in machine learning has focused on the complexity of learning, r...
While most theoretical work in machine learning has focused on the complexity of learning, recently ...
AbstractWhile most theoretical work in machine learning has focused on the complexity of learning, r...
While most theoretical work in machine learning has focused on the complexity of learning, recently ...
The Reinforcement learning (RL) algorithms solve a wide range of problems we faced. The topic of RL ...
We study the problem of inverse reinforcement learning (IRL) with the added twist that the learner i...
We study the problem of inverse reinforcement learning (IRL) with the added twist that the learner i...
Inverse reinforcement learning (IRL) infers a reward function from demonstrations, allowing for poli...
Typically, a reinforcement learning agent interacts with the environment and learns how to select an...
AbstractIn a typical algorithmic learning model, a learner has to identify a target object from part...
AbstractIn a typical algorithmic learning model, a learner has to identify a target object from part...
This paper describes several new online model-free reinforcement learning (RL) algorithms. We design...
Hierarchical Reinforcement Learning (HRL) algorithms can perform planning at multiple levels of abst...
This paper describes several new online model-free reinforcement learning (RL) algorithms. We design...
Reinforcement learning is a machine learning method, which is an unsupervised one which situations a...
AbstractWhile most theoretical work in machine learning has focused on the complexity of learning, r...
While most theoretical work in machine learning has focused on the complexity of learning, recently ...
AbstractWhile most theoretical work in machine learning has focused on the complexity of learning, r...
While most theoretical work in machine learning has focused on the complexity of learning, recently ...
The Reinforcement learning (RL) algorithms solve a wide range of problems we faced. The topic of RL ...
We study the problem of inverse reinforcement learning (IRL) with the added twist that the learner i...
We study the problem of inverse reinforcement learning (IRL) with the added twist that the learner i...
Inverse reinforcement learning (IRL) infers a reward function from demonstrations, allowing for poli...
Typically, a reinforcement learning agent interacts with the environment and learns how to select an...
AbstractIn a typical algorithmic learning model, a learner has to identify a target object from part...
AbstractIn a typical algorithmic learning model, a learner has to identify a target object from part...
This paper describes several new online model-free reinforcement learning (RL) algorithms. We design...
Hierarchical Reinforcement Learning (HRL) algorithms can perform planning at multiple levels of abst...
This paper describes several new online model-free reinforcement learning (RL) algorithms. We design...
Reinforcement learning is a machine learning method, which is an unsupervised one which situations a...