This dissertation explores learning important structural features of a Markov DecisionProcess from offline data to significantly improve the sample-efficiency, stability, and robustnessof solutions even with high dimensional action spaces and long time horizons. Itpresents applications to surgical robot control, data cleaning, and generating efficient executionplans for relational queries. The dissertation contributes: (1) Sequential WindowedReinforcement Learning: a framework that approximates a long-horizon MDP with a sequenceof shorter term MDPs with smooth quadratic cost functions from a small numberof expert demonstrations, (2) Deep Discovery of Options: an algorithm that discovers hierarchicalstructure in the action space from observe...
The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforce...
International audienceMulti-robot task allocation (MRTA) problems require that robots make complex c...
International audienceMulti-robot task allocation (MRTA) problems require that robots take complex c...
Reinforcement learning (RL) is a general framework for learning and evaluating intelligent behaviors...
Reinforcement learning (RL) is an area of Machine Learning (ML) concerned with learning how a softwa...
International audienceDeep learning has provided new ways of manipulating, processing and analyzing ...
This electronic version was submitted by the student author. The certified thesis is available in th...
As robots become increasingly common in modern society, the need for effective machine learning of r...
We present a data-efficient framework for solving sequential decision-making problems which exploits...
The two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (D...
We live in the era of big data in which the advancement of sensor and monitoring technologies, data ...
Reinforcement learning (RL) is a machine learning paradigm where an agent learns to interact with an...
In this thesis, we study how maximum entropy framework can provide efficient deep reinforcement lear...
Robotic systems are ever more capable of automation and fulfilment of complex tasks, particularly wi...
In the past few years, deep reinforcement learning (RL) has shown great potential in learning action...
The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforce...
International audienceMulti-robot task allocation (MRTA) problems require that robots make complex c...
International audienceMulti-robot task allocation (MRTA) problems require that robots take complex c...
Reinforcement learning (RL) is a general framework for learning and evaluating intelligent behaviors...
Reinforcement learning (RL) is an area of Machine Learning (ML) concerned with learning how a softwa...
International audienceDeep learning has provided new ways of manipulating, processing and analyzing ...
This electronic version was submitted by the student author. The certified thesis is available in th...
As robots become increasingly common in modern society, the need for effective machine learning of r...
We present a data-efficient framework for solving sequential decision-making problems which exploits...
The two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (D...
We live in the era of big data in which the advancement of sensor and monitoring technologies, data ...
Reinforcement learning (RL) is a machine learning paradigm where an agent learns to interact with an...
In this thesis, we study how maximum entropy framework can provide efficient deep reinforcement lear...
Robotic systems are ever more capable of automation and fulfilment of complex tasks, particularly wi...
In the past few years, deep reinforcement learning (RL) has shown great potential in learning action...
The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforce...
International audienceMulti-robot task allocation (MRTA) problems require that robots make complex c...
International audienceMulti-robot task allocation (MRTA) problems require that robots take complex c...