We present a framework to address a class of sequential decision making problems. Our framework features learning the optimal control policy with robustness to noisy data, determining the unknown state and action parameters, and performing sensitivity analysis with respect to problem parameters. We consider two broad categories of sequential decision making problems modeled as infinite horizon Markov Decision Processes (MDPs) with (and without) an absorbing state. The central idea underlying our framework is to quantify exploration in terms of the Shannon Entropy of the trajectories under the MDP and determine the stochastic policy that maximizes it while guaranteeing a low value of the expected cost along a trajectory. This resulting polic...
We introduce a class of MPDs which greatly simplify Reinforcement Learning. They have discrete state...
We present a class of metrics, defined on the state space of a finite Markov decision process (MDP)...
In this paper, we study the problem of transferring the available Markov Decision Process (MDP) mode...
In this paper, we propose a max-min entropy framework for reinforcement learning (RL) to overcome th...
Recent research has shown the benefit of framing problems of imitation learning as solutions to Mark...
We introduce a class of Markov decision problems (MDPs) which greatly simplify Reinforcement Learnin...
Recent research has shown the benefit of framing problems of imitation learning as solutions to Mark...
ICML-2023We address the challenge of exploration in reinforcement learning (RL) when the agent opera...
Reinforcement learning (RL) is an important field of research in machine learning that is increasing...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
In this paper, we present a new class of Markov decision processes (MDPs), called Tsallis MDPs, with...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
We introduce a model-free algorithm for learning in Markov decision processes with parameterized act...
We provide new perspectives and inference algorithms for Maximum Entropy (MaxEnt) Inverse Reinforcem...
The goals of perturbation analysis (PA), Markov decision processes (MDPs), and reinforcement learnin...
We introduce a class of MPDs which greatly simplify Reinforcement Learning. They have discrete state...
We present a class of metrics, defined on the state space of a finite Markov decision process (MDP)...
In this paper, we study the problem of transferring the available Markov Decision Process (MDP) mode...
In this paper, we propose a max-min entropy framework for reinforcement learning (RL) to overcome th...
Recent research has shown the benefit of framing problems of imitation learning as solutions to Mark...
We introduce a class of Markov decision problems (MDPs) which greatly simplify Reinforcement Learnin...
Recent research has shown the benefit of framing problems of imitation learning as solutions to Mark...
ICML-2023We address the challenge of exploration in reinforcement learning (RL) when the agent opera...
Reinforcement learning (RL) is an important field of research in machine learning that is increasing...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
In this paper, we present a new class of Markov decision processes (MDPs), called Tsallis MDPs, with...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
We introduce a model-free algorithm for learning in Markov decision processes with parameterized act...
We provide new perspectives and inference algorithms for Maximum Entropy (MaxEnt) Inverse Reinforcem...
The goals of perturbation analysis (PA), Markov decision processes (MDPs), and reinforcement learnin...
We introduce a class of MPDs which greatly simplify Reinforcement Learning. They have discrete state...
We present a class of metrics, defined on the state space of a finite Markov decision process (MDP)...
In this paper, we study the problem of transferring the available Markov Decision Process (MDP) mode...