For an intelligent agent to interact with the environment efficiently, it must have the ability to predict, plan and generalize. This thesis studies how an intelligent agent can learn to predict future observations and leverage the predictive models for efficient policy learning and generalization. The four instances in this thesis are on high-fidelity video prediction, video prediction that handles multi-modal data distribution, predictive model-based reinforcement learning, and model-based zero-shot policy generalization. In the first case, we use a model that disentangles motion and appearance to predict high-fidelity images. We find this method can alleviate the blurry artifact and shape deformation inherited in previous methods. In th...
While computation power has increased and the statistical machine learning methods have made substan...
We present a novel deep learning architecture for probabilistic future prediction from video. We pre...
This dissertation is concerned with the autonomous learning of behavioral models for sequential deci...
For an intelligent agent to interact with the environment efficiently, it must have the ability to p...
Predicting the future in real-world settings, particularly from raw sensory observations such as ima...
Data-efficient learning in continuous state-action spaces using very high-dimensional observations r...
For a robot to interact with its environment, it must perceive the world and understand how the worl...
Abstract — A central problem in artificial intelligence is to choose actions to maximize reward in a...
© 2020 Dmitry GrebenyukA Markov decision process (MDP) cannot be used for learning end-to-end contro...
Predicting the future state of a scene with moving objects is a task that humans handle with ease. T...
My research activity focuses on the integration of acting, learning and planning. The main objective...
People learn skills by interacting with their surroundings from the time of their birth. Reinforceme...
Building autonomous agents that learn to make predictions and take actions in sequential environment...
What is the “state” of a pile of objects? Collections of standard Lagrangian states ideally describe...
Humans can develop their internal model of the external world and use it for decision making. Reinfo...
While computation power has increased and the statistical machine learning methods have made substan...
We present a novel deep learning architecture for probabilistic future prediction from video. We pre...
This dissertation is concerned with the autonomous learning of behavioral models for sequential deci...
For an intelligent agent to interact with the environment efficiently, it must have the ability to p...
Predicting the future in real-world settings, particularly from raw sensory observations such as ima...
Data-efficient learning in continuous state-action spaces using very high-dimensional observations r...
For a robot to interact with its environment, it must perceive the world and understand how the worl...
Abstract — A central problem in artificial intelligence is to choose actions to maximize reward in a...
© 2020 Dmitry GrebenyukA Markov decision process (MDP) cannot be used for learning end-to-end contro...
Predicting the future state of a scene with moving objects is a task that humans handle with ease. T...
My research activity focuses on the integration of acting, learning and planning. The main objective...
People learn skills by interacting with their surroundings from the time of their birth. Reinforceme...
Building autonomous agents that learn to make predictions and take actions in sequential environment...
What is the “state” of a pile of objects? Collections of standard Lagrangian states ideally describe...
Humans can develop their internal model of the external world and use it for decision making. Reinfo...
While computation power has increased and the statistical machine learning methods have made substan...
We present a novel deep learning architecture for probabilistic future prediction from video. We pre...
This dissertation is concerned with the autonomous learning of behavioral models for sequential deci...