Monocular 3D object detection is a low-cost but challenging task, as it requires generating accurate 3D localization solely from a single image input. Recent developed depth-assisted methods show promising results by using explicit depth maps as intermediate features, which are either precomputed by monocular depth estimation networks or jointly evaluated with 3D object detection. However, inevitable errors from estimated depth priors may lead to misaligned semantic information and 3D localization, hence resulting in feature smearing and suboptimal predictions. To mitigate this issue, we propose ADD, an Attention-based Depth knowledge Distillation framework with 3D-aware positional encoding. Unlike previous knowledge distillation frameworks...
As an inherently ill-posed problem, depth estimation from single images is the most challenging part...
Depth represents a crucial piece of information in many practical applications, such as obstacle avo...
In the field of monocular 3D detection, it is common practice to utilize scene geometric clues to en...
Recently, the research on monocular 3D target detection based on pseudo-LiDAR data has made some pro...
Monocular depth estimation is challenging due to its inherent ambiguity and ill-posed nature, yet it...
3D object detection is a fundamental component in the autonomous driving perception pipeline. While ...
While expensive LiDAR and stereo camera rigs have enabled the development of successful 3D object de...
Monocular depth estimation enables 3D perception from a single 2D image, thus attracting much resear...
Monocular 3D object detection has long been a challenging task in autonomous driving, which requires...
Safe autonomous driving requires reliable 3D object detection-the task of estimating 3D bounding box...
Leveraging LiDAR-based detectors or real LiDAR point data to guide monocular 3D detection has brough...
Pseudo-LiDAR 3D detectors have made remarkable progress in monocular 3D detection by enhancing the c...
Estimating depth and detection of object instances in 3D space is fundamental in autonomous navigati...
Monocular 3D object detection is a crucial and challenging task for autonomous driving vehicle, whil...
The labels of monocular 3D object detection (M3OD) are expensive to obtain. Meanwhile, there usually...
As an inherently ill-posed problem, depth estimation from single images is the most challenging part...
Depth represents a crucial piece of information in many practical applications, such as obstacle avo...
In the field of monocular 3D detection, it is common practice to utilize scene geometric clues to en...
Recently, the research on monocular 3D target detection based on pseudo-LiDAR data has made some pro...
Monocular depth estimation is challenging due to its inherent ambiguity and ill-posed nature, yet it...
3D object detection is a fundamental component in the autonomous driving perception pipeline. While ...
While expensive LiDAR and stereo camera rigs have enabled the development of successful 3D object de...
Monocular depth estimation enables 3D perception from a single 2D image, thus attracting much resear...
Monocular 3D object detection has long been a challenging task in autonomous driving, which requires...
Safe autonomous driving requires reliable 3D object detection-the task of estimating 3D bounding box...
Leveraging LiDAR-based detectors or real LiDAR point data to guide monocular 3D detection has brough...
Pseudo-LiDAR 3D detectors have made remarkable progress in monocular 3D detection by enhancing the c...
Estimating depth and detection of object instances in 3D space is fundamental in autonomous navigati...
Monocular 3D object detection is a crucial and challenging task for autonomous driving vehicle, whil...
The labels of monocular 3D object detection (M3OD) are expensive to obtain. Meanwhile, there usually...
As an inherently ill-posed problem, depth estimation from single images is the most challenging part...
Depth represents a crucial piece of information in many practical applications, such as obstacle avo...
In the field of monocular 3D detection, it is common practice to utilize scene geometric clues to en...