We present ATLAS-MVSNet, an end-to-end deep learning architecture relying on local attention layers for depth map inference from multi-view images. Distinct from existing works, we introduce a novel module design for neural networks, which we termed hybrid attention block, that utilizes the latest insights into attention in vision models. We are able to reap the benefits of attention in both, the carefully designed multi-stage feature extraction network and the cost volume regularization network. Our new approach displays significant improvement over its counterpart based purely on convolutions. While many state-of-the-art methods need multiple high-end GPUs in the training phase, we are able to train our network on a single consumer grade ...
Deep learning-based methods have made remarkable progress for stereo matching in terms of accuracy. ...
Over the years, learning-based multi-view stereo methods have achieved great success based on their ...
Estimating depth from images has become a very popular task in computer vision which aims to restore...
Abstract Deep learning has recently been proven to deliver excellent performance in multi-view stere...
We propose an efficient multi-view stereo (MVS) network for infering depth value from multiple RGB i...
We propose an end-to-end deep learning architecture for 3D reconstruction from high-resolution image...
We present DeepMVS, a deep convolutional neural network (ConvNet) for multi-view stereo reconstructi...
Building extraction from very high resolution (VHR) imagery plays an important role in urban plannin...
Stereo reconstruction is a problem of recovering a 3d structure of a scene from a pair of images of ...
The learning‐based multiview stereo (MVS) methods for three‐dimensional (3D) reconstruction generall...
In this work we train in an end-to-end manner a convolutional neural network (CNN) that jointly hand...
We propose a novel technique to incorporate attention within convolutional neural networks using fea...
In this paper, we present a learning-based approach for multi-view stereo (MVS), i.e., estimate the ...
With the successful development in computer vision, building a deep convolutional neural network (CN...
We propose a novel lightweight network for stereo estimation. Our network consists of a fully-convol...
Deep learning-based methods have made remarkable progress for stereo matching in terms of accuracy. ...
Over the years, learning-based multi-view stereo methods have achieved great success based on their ...
Estimating depth from images has become a very popular task in computer vision which aims to restore...
Abstract Deep learning has recently been proven to deliver excellent performance in multi-view stere...
We propose an efficient multi-view stereo (MVS) network for infering depth value from multiple RGB i...
We propose an end-to-end deep learning architecture for 3D reconstruction from high-resolution image...
We present DeepMVS, a deep convolutional neural network (ConvNet) for multi-view stereo reconstructi...
Building extraction from very high resolution (VHR) imagery plays an important role in urban plannin...
Stereo reconstruction is a problem of recovering a 3d structure of a scene from a pair of images of ...
The learning‐based multiview stereo (MVS) methods for three‐dimensional (3D) reconstruction generall...
In this work we train in an end-to-end manner a convolutional neural network (CNN) that jointly hand...
We propose a novel technique to incorporate attention within convolutional neural networks using fea...
In this paper, we present a learning-based approach for multi-view stereo (MVS), i.e., estimate the ...
With the successful development in computer vision, building a deep convolutional neural network (CN...
We propose a novel lightweight network for stereo estimation. Our network consists of a fully-convol...
Deep learning-based methods have made remarkable progress for stereo matching in terms of accuracy. ...
Over the years, learning-based multi-view stereo methods have achieved great success based on their ...
Estimating depth from images has become a very popular task in computer vision which aims to restore...