Multiple object detection and pose estimation are vital computer vision tasks. The latter relates to the former as a downstream problem in applications such as robotics and autonomous driving. However, due to the high complexity of both tasks, existing methods generally treat them independently, which is sub-optimal. We propose simultaneous neural modeling of both using monocular vision and 3D model infusion. Our Simultaneous Multiple Object detection and Pose Estimation network (SMOPE-Net) is an end-to-end trainable multitasking network with a composite loss that also provides the advantages of anchor-free detections for efficient downstream pose estimation. To enable the annotation of training data for our learning objective, we develop a...
Simultaneous object recognition and pose estimation are two key functionalities for robots to safely...
Visual odometry is the process of estimating incremental localization of the camera in 3-dimensional...
Pose estimation of 3D objects in monocular images is a fundamental and long-standing problem in comp...
We introduce a scalable approach for object pose estima-tion trained on simulated RGB views of multi...
In computer vision pose estimation of objects in everyday scenes is a basic need for a clearundersta...
Learning general image representations has proven key to the success of many computer vision tasks. ...
In this work, we present, LieNet, a novel deep learning framework that simultaneously detects, segme...
Applications in the field of augmented reality or robotics often require joint localisation and 6D p...
In this survey we present a complete landscape of joint object detection and pose estimation methods...
Numerous ‘non-maximum suppression’ (NMS) post-processing schemes have been proposed for merging mult...
Accurate 3D human pose estimation from single images is possible with sophisticated deep-net archite...
Precise localization in the form of 6-DoF pose estimation is of great value in traffic scenario res...
Simultaneous object recognition and pose estimation are two key functionalities for robots to safely...
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
A new learning strategy for object detection is presented. The proposed scheme forgoes the need to t...
Simultaneous object recognition and pose estimation are two key functionalities for robots to safely...
Visual odometry is the process of estimating incremental localization of the camera in 3-dimensional...
Pose estimation of 3D objects in monocular images is a fundamental and long-standing problem in comp...
We introduce a scalable approach for object pose estima-tion trained on simulated RGB views of multi...
In computer vision pose estimation of objects in everyday scenes is a basic need for a clearundersta...
Learning general image representations has proven key to the success of many computer vision tasks. ...
In this work, we present, LieNet, a novel deep learning framework that simultaneously detects, segme...
Applications in the field of augmented reality or robotics often require joint localisation and 6D p...
In this survey we present a complete landscape of joint object detection and pose estimation methods...
Numerous ‘non-maximum suppression’ (NMS) post-processing schemes have been proposed for merging mult...
Accurate 3D human pose estimation from single images is possible with sophisticated deep-net archite...
Precise localization in the form of 6-DoF pose estimation is of great value in traffic scenario res...
Simultaneous object recognition and pose estimation are two key functionalities for robots to safely...
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
A new learning strategy for object detection is presented. The proposed scheme forgoes the need to t...
Simultaneous object recognition and pose estimation are two key functionalities for robots to safely...
Visual odometry is the process of estimating incremental localization of the camera in 3-dimensional...
Pose estimation of 3D objects in monocular images is a fundamental and long-standing problem in comp...