We propose \textbf{DMV3D}, a novel 3D generation approach that uses a transformer-based 3D large reconstruction model to denoise multi-view diffusion. Our reconstruction model incorporates a triplane NeRF representation and can denoise noisy multi-view images via NeRF reconstruction and rendering, achieving single-stage 3D generation in $\sim$30s on single A100 GPU. We train \textbf{DMV3D} on large-scale multi-view image datasets of highly diverse objects using only image reconstruction losses, without accessing 3D assets. We demonstrate state-of-the-art results for the single-image reconstruction problem where probabilistic modeling of unseen object parts is required for generating diverse reconstructions with sharp textures. We also show ...
We present an approach to generate a 360-degree view of a person with a consistent, high-resolution ...
Recent progress in 3D scene understanding enables scalable learning of representations across large ...
Currently, applying diffusion models in pixel space of high resolution images is difficult. Instead,...
We present Viewset Diffusion, a diffusion-based generator that outputs 3D objects while only using m...
We propose MVDream, a multi-view diffusion model that is able to generate geometrically consistent m...
We present 3DiM, a diffusion model for 3D novel view synthesis, which is able to translate a single ...
Single-view novel view synthesis, the task of generating images from new viewpoints based on a singl...
In the last years, Denoising Diffusion Probabilistic Models (DDPMs) obtained state-of-the-art result...
We propose an effective denoising diffusion model for generating high-resolution images (e.g., 1024$...
Semantic-driven 3D shape generation aims to generate 3D objects conditioned on text. Previous works ...
Denoising diffusion models (DDMs) have led to staggering performance leaps in image generation, edit...
Diffusion models generating images conditionally on text, such as Dall-E 2 and Stable Diffusion, hav...
Diffusion models have emerged as the new state-of-the-art generative model with high quality samples...
Material reconstruction from a photograph is a key component of 3D content creation democratization....
Denoising diffusion probabilistic models (DDPMs) have been proven capable of synthesizing high-quali...
We present an approach to generate a 360-degree view of a person with a consistent, high-resolution ...
Recent progress in 3D scene understanding enables scalable learning of representations across large ...
Currently, applying diffusion models in pixel space of high resolution images is difficult. Instead,...
We present Viewset Diffusion, a diffusion-based generator that outputs 3D objects while only using m...
We propose MVDream, a multi-view diffusion model that is able to generate geometrically consistent m...
We present 3DiM, a diffusion model for 3D novel view synthesis, which is able to translate a single ...
Single-view novel view synthesis, the task of generating images from new viewpoints based on a singl...
In the last years, Denoising Diffusion Probabilistic Models (DDPMs) obtained state-of-the-art result...
We propose an effective denoising diffusion model for generating high-resolution images (e.g., 1024$...
Semantic-driven 3D shape generation aims to generate 3D objects conditioned on text. Previous works ...
Denoising diffusion models (DDMs) have led to staggering performance leaps in image generation, edit...
Diffusion models generating images conditionally on text, such as Dall-E 2 and Stable Diffusion, hav...
Diffusion models have emerged as the new state-of-the-art generative model with high quality samples...
Material reconstruction from a photograph is a key component of 3D content creation democratization....
Denoising diffusion probabilistic models (DDPMs) have been proven capable of synthesizing high-quali...
We present an approach to generate a 360-degree view of a person with a consistent, high-resolution ...
Recent progress in 3D scene understanding enables scalable learning of representations across large ...
Currently, applying diffusion models in pixel space of high resolution images is difficult. Instead,...