Homography estimation is an important step in many computer vision problems. Recently, deep neural network methods have shown to be favorable for this problem when compared to traditional methods. However, these new methods do not consider dynamic content in input images. They train neural networks with only image pairs that can be perfectly aligned using homographies. This paper investigates and discusses how to design and train a deep neural network that handles dynamic scenes. We first collect a large video dataset with dynamic content. We then develop a multi-scale neural network and show that when properly trained using our new dataset, this neural network can already handle dynamic scenes to some extent. To estimate a homography of a ...
© 2016 NIPS Foundation - All Rights Reserved. In a traditional convolutional layer, the learned filt...
A collection of images of a scene captured from different perspectives inform us about the scene's c...
When a planar structure is observed from multiple views, the projections of its corresponding 3D poi...
Homography estimation of infrared and visible images is a highly challenging task in computer vision...
The standard approach to the estimation of homographies consists in the application of the RANSAC al...
This thesis proposes an optimized convolutional neural network architecture to improve homography es...
Most classic approaches to homography estimation are based on the filtering of outliers by means of ...
Homography estimation is a fundamental task in many computer vision applications, but many technique...
Homography is an important area of computer vision for scene understanding and plays a key role in e...
Homography estimation is erroneous in the case of large-baseline due to the low image overlay and li...
Planar homography estimation refers to the problem of computing a bijective linear mapping of pixels...
Recent works have shown that deep learning methods can improve the performance of the homography est...
Sequential data such as video are characterized by spatio-temporal redundancies. As of yet, few deep...
We introduce the concept of dynamic image, a novel compact representation of videos useful for video...
Sequential data such as video are characterized by spatio-temporal redundancies. As of yet, few deep...
© 2016 NIPS Foundation - All Rights Reserved. In a traditional convolutional layer, the learned filt...
A collection of images of a scene captured from different perspectives inform us about the scene's c...
When a planar structure is observed from multiple views, the projections of its corresponding 3D poi...
Homography estimation of infrared and visible images is a highly challenging task in computer vision...
The standard approach to the estimation of homographies consists in the application of the RANSAC al...
This thesis proposes an optimized convolutional neural network architecture to improve homography es...
Most classic approaches to homography estimation are based on the filtering of outliers by means of ...
Homography estimation is a fundamental task in many computer vision applications, but many technique...
Homography is an important area of computer vision for scene understanding and plays a key role in e...
Homography estimation is erroneous in the case of large-baseline due to the low image overlay and li...
Planar homography estimation refers to the problem of computing a bijective linear mapping of pixels...
Recent works have shown that deep learning methods can improve the performance of the homography est...
Sequential data such as video are characterized by spatio-temporal redundancies. As of yet, few deep...
We introduce the concept of dynamic image, a novel compact representation of videos useful for video...
Sequential data such as video are characterized by spatio-temporal redundancies. As of yet, few deep...
© 2016 NIPS Foundation - All Rights Reserved. In a traditional convolutional layer, the learned filt...
A collection of images of a scene captured from different perspectives inform us about the scene's c...
When a planar structure is observed from multiple views, the projections of its corresponding 3D poi...