Most modern deep learning-based multi-view 3D reconstruction techniques use RNNs or fusion modules to combine information from multiple images after independently encoding them. These two separate steps have loose connections and do not allow easy information sharing among views. We propose LegoFormer, a transformer model for voxel-based 3D reconstruction that uses the attention layers to share information among views during all computational stages. Moreover, instead of predicting each voxel independently, we propose to parametrize the output with a series of low-rank decomposition factors. This reformulation allows the prediction of an object as a set of independent regular structures then aggregated to obtain the final reconstruction. Ex...
The objective of this paper is 3D shape understanding from single and multiple images. To this end, ...
Due to the limitation of less information in a single image, it is very difficult to generate a high...
For decades, a vital area of computer vision research has been multiview stereo (MVS), which creates...
The goal of this study is to determine the effectiveness of different 3D shape representations in le...
We propose a transformer-based neural network architecture for multi-object 3D reconstruction from R...
Most of the research on automated LEGO construction focus on improving structural strength and stabi...
We study the problem of recovering an underlying 3D shape from a set of images. Existing learning ba...
We study the problem of learning generative models of 3D shapes. Voxels or 3D parts have been widely...
We study the problem of recovering an underlying 3D shape from a set of images. Existing learning ba...
We present a single-view voxel model prediction method that uses generative adversarial networks. Ou...
Our goal is to learn a deep network that, given a small number of images of an object of a given cat...
Deep learning technology has made great progress in multi-view 3D reconstruction tasks. At present, ...
Recent studies have shown that deep learning achieves excellent performance in reconstructing 3D sce...
To endow machines with the ability to perceive the real-world in a three dimensional representation ...
In this paper, we propose a novel approach, 3D-RecGAN++, which reconstructs the complete 3D structur...
The objective of this paper is 3D shape understanding from single and multiple images. To this end, ...
Due to the limitation of less information in a single image, it is very difficult to generate a high...
For decades, a vital area of computer vision research has been multiview stereo (MVS), which creates...
The goal of this study is to determine the effectiveness of different 3D shape representations in le...
We propose a transformer-based neural network architecture for multi-object 3D reconstruction from R...
Most of the research on automated LEGO construction focus on improving structural strength and stabi...
We study the problem of recovering an underlying 3D shape from a set of images. Existing learning ba...
We study the problem of learning generative models of 3D shapes. Voxels or 3D parts have been widely...
We study the problem of recovering an underlying 3D shape from a set of images. Existing learning ba...
We present a single-view voxel model prediction method that uses generative adversarial networks. Ou...
Our goal is to learn a deep network that, given a small number of images of an object of a given cat...
Deep learning technology has made great progress in multi-view 3D reconstruction tasks. At present, ...
Recent studies have shown that deep learning achieves excellent performance in reconstructing 3D sce...
To endow machines with the ability to perceive the real-world in a three dimensional representation ...
In this paper, we propose a novel approach, 3D-RecGAN++, which reconstructs the complete 3D structur...
The objective of this paper is 3D shape understanding from single and multiple images. To this end, ...
Due to the limitation of less information in a single image, it is very difficult to generate a high...
For decades, a vital area of computer vision research has been multiview stereo (MVS), which creates...