We propose DiffuStereo, a novel system using only sparse cameras (8 in this work) for high-quality 3D human reconstruction. At its core is a novel diffusion-based stereo module, which introduces diffusion models, a type of powerful generative models, into the iterative stereo matching network. To this end, we design a new diffusion kernel and additional stereo constraints to facilitate stereo matching and depth estimation in the network. We further present a multi-level stereo network architecture to handle high-resolution (up to 4k) inputs without requiring unaffordable memory footprint. Given a set of sparse-view color images of a human, the proposed multi-level diffusion-based stereo network can produce highly accurate depth maps, which ...
none5siIn many fields, self-supervised learning solutions are rapidly evolving and filling the gap w...
International audienceThis paper proposes a learning based solution to disparity (depth) estimation ...
none5siStereo matching is one of the longest-standing problems in computer vision with close to 40 y...
We present an innovative approach to 3D Human Pose Estimation (3D-HPE) by integrating cutting-edge d...
We present an approach to generate a 360-degree view of a person with a consistent, high-resolution ...
We present RGB-D-Fusion, a multi-modal conditional denoising diffusion probabilistic model to genera...
Diffusion models have emerged as the new state-of-the-art generative model with high quality samples...
This paper proposes a novel method for depth completion, which leverages multi-view improved monitor...
We present an approach to generating 3D human models from images. The key to our framework is that w...
International audienceThis paper addresses the problem of range-stereo fusion, for the construction ...
Stereo reconstruction is a problem of recovering a 3d structure of a scene from a pair of images of ...
Purpose: Our aim is obtaining a dense and reliable disparity map from a stereoscopic pair under any...
Existing deep calibrated photometric stereo networks basically aggregate observations under differen...
Depth estimation from stereo images remains a challenge even though studied for decades. The KITTI b...
In this paper, we introduce HDhuman, a method that addresses the challenge of novel view rendering o...
none5siIn many fields, self-supervised learning solutions are rapidly evolving and filling the gap w...
International audienceThis paper proposes a learning based solution to disparity (depth) estimation ...
none5siStereo matching is one of the longest-standing problems in computer vision with close to 40 y...
We present an innovative approach to 3D Human Pose Estimation (3D-HPE) by integrating cutting-edge d...
We present an approach to generate a 360-degree view of a person with a consistent, high-resolution ...
We present RGB-D-Fusion, a multi-modal conditional denoising diffusion probabilistic model to genera...
Diffusion models have emerged as the new state-of-the-art generative model with high quality samples...
This paper proposes a novel method for depth completion, which leverages multi-view improved monitor...
We present an approach to generating 3D human models from images. The key to our framework is that w...
International audienceThis paper addresses the problem of range-stereo fusion, for the construction ...
Stereo reconstruction is a problem of recovering a 3d structure of a scene from a pair of images of ...
Purpose: Our aim is obtaining a dense and reliable disparity map from a stereoscopic pair under any...
Existing deep calibrated photometric stereo networks basically aggregate observations under differen...
Depth estimation from stereo images remains a challenge even though studied for decades. The KITTI b...
In this paper, we introduce HDhuman, a method that addresses the challenge of novel view rendering o...
none5siIn many fields, self-supervised learning solutions are rapidly evolving and filling the gap w...
International audienceThis paper proposes a learning based solution to disparity (depth) estimation ...
none5siStereo matching is one of the longest-standing problems in computer vision with close to 40 y...