In this paper we propose an approach for monocular 3D object detection from a single RGB image, which leverages a novel disentangling transformation for 2D and 3D detection losses and a novel, self-supervised confidence score for 3D bounding boxes. Our proposed loss disentanglement has the twofold advantage of simplifying the training dynamics in the presence of losses with complex interactions of parameters, and sidestepping the issue of balancing independent regression terms. Our solution overcomes these issues by isolating the contribution made by groups of parameters to a given loss, without changing its nature. We further apply loss disentanglement to another novel, signed Intersection-over-Union criterion-driven loss for improving 2D ...
Object recognition is one of the fundamental tasks of computer vision. Recent advances in the field ...
Three-dimensional (3D) object detection is an important task in the field of machine vision, in whic...
Object recognition is one of the fundamental tasks of computer vision. Recent advances in the field ...
In this paper we introduce a method for multi-class, monocular 3D object detection from a single RGB...
In this paper we propose a novel 3D single-shot object detection method for detecting vehicles in mo...
Monocular 3D object detection task aims to predict the 3D bounding boxes of objects based on monocul...
In this work, we propose an efficient and accurate monocular 3D detection framework in single shot. ...
Estimating 3D orientation and translation of objects is essential for infrastructure-less autonomous...
Safe autonomous driving requires reliable 3D object detection-the task of estimating 3D bounding box...
Monocular 3D object detection continues to attract attention due to the cost benefits and wider avai...
Monocular 3D object detection (Mono3D) has achieved tremendous improvements with emerging large-scal...
The labels of monocular 3D object detection (M3OD) are expensive to obtain. Meanwhile, there usually...
Monocular 3D object detection has recently become prevalent in autonomous driving and navigation app...
One key task of the environment perception pipeline for autonomous driving is object detection using...
While expensive LiDAR and stereo camera rigs have enabled the development of successful 3D object de...
Object recognition is one of the fundamental tasks of computer vision. Recent advances in the field ...
Three-dimensional (3D) object detection is an important task in the field of machine vision, in whic...
Object recognition is one of the fundamental tasks of computer vision. Recent advances in the field ...
In this paper we introduce a method for multi-class, monocular 3D object detection from a single RGB...
In this paper we propose a novel 3D single-shot object detection method for detecting vehicles in mo...
Monocular 3D object detection task aims to predict the 3D bounding boxes of objects based on monocul...
In this work, we propose an efficient and accurate monocular 3D detection framework in single shot. ...
Estimating 3D orientation and translation of objects is essential for infrastructure-less autonomous...
Safe autonomous driving requires reliable 3D object detection-the task of estimating 3D bounding box...
Monocular 3D object detection continues to attract attention due to the cost benefits and wider avai...
Monocular 3D object detection (Mono3D) has achieved tremendous improvements with emerging large-scal...
The labels of monocular 3D object detection (M3OD) are expensive to obtain. Meanwhile, there usually...
Monocular 3D object detection has recently become prevalent in autonomous driving and navigation app...
One key task of the environment perception pipeline for autonomous driving is object detection using...
While expensive LiDAR and stereo camera rigs have enabled the development of successful 3D object de...
Object recognition is one of the fundamental tasks of computer vision. Recent advances in the field ...
Three-dimensional (3D) object detection is an important task in the field of machine vision, in whic...
Object recognition is one of the fundamental tasks of computer vision. Recent advances in the field ...