In this thesis, we study the related problems of reinforcement learning and optimal adaptive control, specialized to specific classes of stochastic and structured dynamical systems. By stochastic, we mean systems that are unknown to the decision maker and evolve according to some probabilistic law. By structured, we mean that they are restricted in some known way, e.g., they belong to a specific model class or must obey a set of known constraints. The objective in both problems is the design of an optimal algorithm, i.e., one that maximizes a certain performance metric. Because of the stochasticity, the algorithm faces an exploration-exploitation dilemma, where it must balance collecting information from the system and leveraging existing i...
Optimal control and Reinforcement Learning deal both with sequential decision-making problems, altho...
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are thos...
In this paper we consider the problem of reinforcement learning in a dynamically changing environmen...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
We consider a class of sequential decision making problems in the presence of uncertainty, which bel...
We consider an agent interacting with an environment in a single stream of actions, observations, an...
We consider an agent interacting with an en-vironment in a single stream of actions, ob-servations, ...
We consider the problem of minimizing the long term average expected regret of an agent in an online...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
In this article we study the connection of stochastic optimal control and reinforcement learning. Ou...
We consider the restless Markov bandit problem, in which the state of each arm evolves according to ...
International audienceWe consider the restless Markov bandit problem, in which the state of each arm...
Reinforcement learning (RL) has gained an increasing interest in recent years, being expected to del...
International audienceWe consider reinforcement learning in a discrete, undiscounted, infinite-horiz...
2014-10-14This dissertation addresses some problems in the area of learning, optimization and decisi...
Optimal control and Reinforcement Learning deal both with sequential decision-making problems, altho...
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are thos...
In this paper we consider the problem of reinforcement learning in a dynamically changing environmen...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
We consider a class of sequential decision making problems in the presence of uncertainty, which bel...
We consider an agent interacting with an environment in a single stream of actions, observations, an...
We consider an agent interacting with an en-vironment in a single stream of actions, ob-servations, ...
We consider the problem of minimizing the long term average expected regret of an agent in an online...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
In this article we study the connection of stochastic optimal control and reinforcement learning. Ou...
We consider the restless Markov bandit problem, in which the state of each arm evolves according to ...
International audienceWe consider the restless Markov bandit problem, in which the state of each arm...
Reinforcement learning (RL) has gained an increasing interest in recent years, being expected to del...
International audienceWe consider reinforcement learning in a discrete, undiscounted, infinite-horiz...
2014-10-14This dissertation addresses some problems in the area of learning, optimization and decisi...
Optimal control and Reinforcement Learning deal both with sequential decision-making problems, altho...
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are thos...
In this paper we consider the problem of reinforcement learning in a dynamically changing environmen...