In this work, we address the challenging task of 3D object recognition without the reliance on real-world 3D labeled data. Our goal is to predict the 3D shape, size, and 6D pose of objects within a single RGB-D image, operating at the category level and eliminating the need for CAD models during inference. While existing self-supervised methods have made strides in this field, they often suffer from inefficiencies arising from non-end-to-end processing, reliance on separate models for different object categories, and slow surface extraction during the training of implicit reconstruction models; thus hindering both the speed and real-world applicability of the 3D recognition process. Our proposed method leverages a multi-stage training pipel...
Visual perception tasks often require vast amounts of labelled data, including 3D poses and image sp...
Tracking an object's 6D pose, while either the object itself or the observing camera is moving, is i...
Mining object-level knowledge, that is, building a comprehensive category model base, from a large s...
Scalable 6D pose estimation for rigid objects from RGB images aims at handling multiple objects and ...
This paper presents 6D-ViT, a transformer-based instance representation learning network, which is s...
It is difficult to precisely annotate object instances and their semantics in 3D space, and as such,...
To enable meaningful robotic manipulation of objects in the real-world, 6D pose estimation is one of...
We propose a real-time RGB-based pipeline for object detection and 6D pose estimation. Our novel 3D ...
Recent works on 6D object pose estimation focus on learning keypoint correspondences between images ...
Applications in the field of augmented reality or robotics often require joint localisation and 6D p...
Most recent 6D object pose estimation methods, including unsupervised ones, require many real traini...
While showing promising results, recent RGB-D camera-based category-level object pose estimation met...
Recently, RGBD-based category-level 6D object pose estimation has achieved promising improvement in ...
The performance of existing single-view 3D reconstruction methods heavily relies on large-scale 3D a...
We propose a single-shot approach for simultaneously detecting an object in an RGB image and predict...
Visual perception tasks often require vast amounts of labelled data, including 3D poses and image sp...
Tracking an object's 6D pose, while either the object itself or the observing camera is moving, is i...
Mining object-level knowledge, that is, building a comprehensive category model base, from a large s...
Scalable 6D pose estimation for rigid objects from RGB images aims at handling multiple objects and ...
This paper presents 6D-ViT, a transformer-based instance representation learning network, which is s...
It is difficult to precisely annotate object instances and their semantics in 3D space, and as such,...
To enable meaningful robotic manipulation of objects in the real-world, 6D pose estimation is one of...
We propose a real-time RGB-based pipeline for object detection and 6D pose estimation. Our novel 3D ...
Recent works on 6D object pose estimation focus on learning keypoint correspondences between images ...
Applications in the field of augmented reality or robotics often require joint localisation and 6D p...
Most recent 6D object pose estimation methods, including unsupervised ones, require many real traini...
While showing promising results, recent RGB-D camera-based category-level object pose estimation met...
Recently, RGBD-based category-level 6D object pose estimation has achieved promising improvement in ...
The performance of existing single-view 3D reconstruction methods heavily relies on large-scale 3D a...
We propose a single-shot approach for simultaneously detecting an object in an RGB image and predict...
Visual perception tasks often require vast amounts of labelled data, including 3D poses and image sp...
Tracking an object's 6D pose, while either the object itself or the observing camera is moving, is i...
Mining object-level knowledge, that is, building a comprehensive category model base, from a large s...