We present an approach to generating 3D human models from images. The key to our framework is that we predict double-sided orthographic depth maps and color images from a single perspective projected image. Our framework consists of three networks. The first network predicts normal maps to recover geometric details such as wrinkles in the clothes and facial regions. The second network predicts shade-removed images for the front and back views by utilizing the predicted normal maps. The last multi-headed network takes both normal maps and shade-free images and predicts depth maps while selectively fusing photometric and geometric information through multi-headed attention gates. Experimental results demonstrate that our method shows visually...
We propose a novel monocular depth estimator, which improves the prediction accuracy on human region...
Monocular image-based 3D reconstruction of faces is a long-standing problem in computer vision. Sinc...
Human motion capture either requires multi-camera systems or is unreliable using single-view input d...
In this paper, we revisit the problem of 3D human modeling from two orthogonal silhouettes of indivi...
We present an approach to generate a 360-degree view of a person with a consistent, high-resolution ...
In this work, we present a new method for 3D face reconstruction from sparse-view RGB images. Unlike...
We present 3DHumanGAN, a 3D-aware generative adversarial network that synthesizes photorealistic ima...
High-fidelity 3D scene reconstruction from monocular videos continues to be challenging, especially ...
We propose DiffuStereo, a novel system using only sparse cameras (8 in this work) for high-quality 3...
International audienceIn this work we address the problem of estimating 3D human pose from a single ...
Obtaining personalized 3D animatable avatars from a monocular camera has several real world applicat...
In this paper, an adversarial architecture for facial depth map estimation from monocular intensity ...
Nowadays, 3D reconstruction from images has played an important role in computer vision with many im...
Many mobile manufacturers recently have adopted Dual-Pixel (DP) sensors in their flagship models for...
Human figures frequently occur on pictorial maps besides other illustrative entities. In this work, ...
We propose a novel monocular depth estimator, which improves the prediction accuracy on human region...
Monocular image-based 3D reconstruction of faces is a long-standing problem in computer vision. Sinc...
Human motion capture either requires multi-camera systems or is unreliable using single-view input d...
In this paper, we revisit the problem of 3D human modeling from two orthogonal silhouettes of indivi...
We present an approach to generate a 360-degree view of a person with a consistent, high-resolution ...
In this work, we present a new method for 3D face reconstruction from sparse-view RGB images. Unlike...
We present 3DHumanGAN, a 3D-aware generative adversarial network that synthesizes photorealistic ima...
High-fidelity 3D scene reconstruction from monocular videos continues to be challenging, especially ...
We propose DiffuStereo, a novel system using only sparse cameras (8 in this work) for high-quality 3...
International audienceIn this work we address the problem of estimating 3D human pose from a single ...
Obtaining personalized 3D animatable avatars from a monocular camera has several real world applicat...
In this paper, an adversarial architecture for facial depth map estimation from monocular intensity ...
Nowadays, 3D reconstruction from images has played an important role in computer vision with many im...
Many mobile manufacturers recently have adopted Dual-Pixel (DP) sensors in their flagship models for...
Human figures frequently occur on pictorial maps besides other illustrative entities. In this work, ...
We propose a novel monocular depth estimator, which improves the prediction accuracy on human region...
Monocular image-based 3D reconstruction of faces is a long-standing problem in computer vision. Sinc...
Human motion capture either requires multi-camera systems or is unreliable using single-view input d...