Building autonomous agents that learn to make predictions and take actions in sequential environments is a central problem in artificial intelligence, with applications as diverse as personalized medicine, self-driving cars, finance, and scientific discovery. Despite impressive success in certain areas such as natural language, games, and robotic demonstrations, sequential prediction and decision-making remains challenging in the absence of known models, accurate environment simulators, short-range dependencies, and large and diverse datasets.In this thesis, we formulate problems to capture challenging yet prevalent settings encountered in the real-world. Given the formulations, we then design reliable and efficient learning algorithms, lev...
The use of artificial intelligence in systems for autonomous vehicles is growing in popularity [1, 2...
In many Reinforcement Learning (RL) tasks, the classical online interaction of the learning agent wi...
Sequential decision making under uncertainty problems often deal with partially observable Markov de...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
<p>Sequential prediction problems arise commonly in many areas of robotics and information processin...
Sequential prediction problems arise commonly in many areas of robotics and information processing: ...
Offline Reinforcement Learning (RL) aims at learning an optimal control from a fixed dataset, withou...
Because the physical world is complex, ambiguous, and unpredictable, autonomous agents must be engin...
This thesis focuses on Reinforcement Learning (RL) which considers an agent that makes sequen- tial ...
This thesis focuses on Reinforcement Learning (RL) which considers an agent that makes sequen- tial ...
Information gathering in a partially observable environment can be formulated as a reinforcement lea...
The problem of making decisions is ubiquitous in life. This problem becomes even more complex when t...
In recent years, artificial learning systems have demonstrated tremendous advances on a number of ch...
The use of artificial intelligence in systems for autonomous vehicles is growing in popularity [1, 2...
In many Reinforcement Learning (RL) tasks, the classical online interaction of the learning agent wi...
Sequential decision making under uncertainty problems often deal with partially observable Markov de...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
<p>Sequential prediction problems arise commonly in many areas of robotics and information processin...
Sequential prediction problems arise commonly in many areas of robotics and information processing: ...
Offline Reinforcement Learning (RL) aims at learning an optimal control from a fixed dataset, withou...
Because the physical world is complex, ambiguous, and unpredictable, autonomous agents must be engin...
This thesis focuses on Reinforcement Learning (RL) which considers an agent that makes sequen- tial ...
This thesis focuses on Reinforcement Learning (RL) which considers an agent that makes sequen- tial ...
Information gathering in a partially observable environment can be formulated as a reinforcement lea...
The problem of making decisions is ubiquitous in life. This problem becomes even more complex when t...
In recent years, artificial learning systems have demonstrated tremendous advances on a number of ch...
The use of artificial intelligence in systems for autonomous vehicles is growing in popularity [1, 2...
In many Reinforcement Learning (RL) tasks, the classical online interaction of the learning agent wi...
Sequential decision making under uncertainty problems often deal with partially observable Markov de...