Recent work has demonstrated that it is possible to learn deep neural networks for monocular depth and ego-motion estimation from unlabelled video sequences, an interesting theoretical development with numerous advantages in applications. In this paper, we propose a number of improvements to these approaches. First, since such self-supervised approaches are based on the brightness constancy assumption, which is valid only for a subset of pixels, we propose a probabilistic learning formulation where the network predicts distributions over variables rather than specific values. As these distributions are conditioned on the observed image, the network can learn which scene and object types are likely to violate the model assumptions, resulting...
The best way to combine the results of deep learning with standard 3D reconstruction pipelines remai...
Deployment of deep neural networks (DNNs) for monocular depth estimation in safety-critical scenari...
Self-supervised monocular depth estimation networks are trained to predict scene depth using nearby ...
Recent work has demonstrated that it is possible to learn deep neural networks for monocular depth a...
Learning to predict scene depth from RGB inputs is a challenging task both for indoor and outdoor ro...
Estimating a dense depth map from a single view is geometrically ill-posed, and state-of-the-art met...
none7noWhole understanding of the surroundings is paramount to autonomous systems. Recent works have...
This paper proposes a novel unsupervised learning framework for depth recovery and camera ego-motion...
Reconstruction happens in the human brain every day. When humans watch their surrounding scene, they...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Structure-from-Motion (SfM) using the frames of a video sequence can be a challenging task because t...
We propose a novel method for learning convolutional neural image representations without manual sup...
We propose a monocular depth estimation method SC-Depth, which requires only unlabelled videos for t...
The recent rise in machine learning has been largely made possible by novel algorithms, such as con...
Despite well-established baselines, learning of scene depth and ego-motion from monocular video rema...
The best way to combine the results of deep learning with standard 3D reconstruction pipelines remai...
Deployment of deep neural networks (DNNs) for monocular depth estimation in safety-critical scenari...
Self-supervised monocular depth estimation networks are trained to predict scene depth using nearby ...
Recent work has demonstrated that it is possible to learn deep neural networks for monocular depth a...
Learning to predict scene depth from RGB inputs is a challenging task both for indoor and outdoor ro...
Estimating a dense depth map from a single view is geometrically ill-posed, and state-of-the-art met...
none7noWhole understanding of the surroundings is paramount to autonomous systems. Recent works have...
This paper proposes a novel unsupervised learning framework for depth recovery and camera ego-motion...
Reconstruction happens in the human brain every day. When humans watch their surrounding scene, they...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Structure-from-Motion (SfM) using the frames of a video sequence can be a challenging task because t...
We propose a novel method for learning convolutional neural image representations without manual sup...
We propose a monocular depth estimation method SC-Depth, which requires only unlabelled videos for t...
The recent rise in machine learning has been largely made possible by novel algorithms, such as con...
Despite well-established baselines, learning of scene depth and ego-motion from monocular video rema...
The best way to combine the results of deep learning with standard 3D reconstruction pipelines remai...
Deployment of deep neural networks (DNNs) for monocular depth estimation in safety-critical scenari...
Self-supervised monocular depth estimation networks are trained to predict scene depth using nearby ...