We address the task of open-world class-agnostic object detection, i.e., detecting every object in an image by learning from a limited number of base object classes. State-of-the-art RGB-based models suffer from overfitting the training classes and often fail at detecting novel-looking objects. This is because RGB-based models primarily rely on appearance similarity to detect novel objects and are also prone to overfitting short-cut cues such as textures and discriminative parts. To address these shortcomings of RGB-based object detectors, we propose incorporating geometric cues such as depth and normals, predicted by general-purpose monocular estimators. Specifically, we use the geometric cues to train an object proposal network for pseudo...
Object detection, which aims to recognize and locate objects within images using bounding boxes, is ...
We present a list of datasets and their best models with the goal of advancing the state-of-the-art ...
Building robust and generic object detection frameworks requires scaling to larger label spaces and ...
Focusing on the issue of how to effectively capture and utilize cross-modality information in RGB-D ...
Although modern object detectors rely heavily on a significant amount of training data, humans can e...
The task of finding objects belonging to classes of interest in images has long been a focus of Comp...
The performance of object detection has steadily improved over the past decade, primarily due to imp...
Motivation: Current work on visual object recognition focuses on object classification and is implic...
3D object detection has achieved significant performance in many fields, e.g., robotics system, auto...
This paper introduces the Budding Ensemble Architecture (BEA), a novel reduced ensemble architecture...
Despite great progress in object detection, most existing methods work only on a limited set of obje...
Object detection is a fundamental computer vision task that estimates object classification labels a...
Object recognition systems today see the world as a collection of object categories, each existing a...
As an inherently ill-posed problem, depth estimation from single images is the most challenging part...
Open-set object detection aims at detecting arbitrary categories beyond those seen during training. ...
Object detection, which aims to recognize and locate objects within images using bounding boxes, is ...
We present a list of datasets and their best models with the goal of advancing the state-of-the-art ...
Building robust and generic object detection frameworks requires scaling to larger label spaces and ...
Focusing on the issue of how to effectively capture and utilize cross-modality information in RGB-D ...
Although modern object detectors rely heavily on a significant amount of training data, humans can e...
The task of finding objects belonging to classes of interest in images has long been a focus of Comp...
The performance of object detection has steadily improved over the past decade, primarily due to imp...
Motivation: Current work on visual object recognition focuses on object classification and is implic...
3D object detection has achieved significant performance in many fields, e.g., robotics system, auto...
This paper introduces the Budding Ensemble Architecture (BEA), a novel reduced ensemble architecture...
Despite great progress in object detection, most existing methods work only on a limited set of obje...
Object detection is a fundamental computer vision task that estimates object classification labels a...
Object recognition systems today see the world as a collection of object categories, each existing a...
As an inherently ill-posed problem, depth estimation from single images is the most challenging part...
Open-set object detection aims at detecting arbitrary categories beyond those seen during training. ...
Object detection, which aims to recognize and locate objects within images using bounding boxes, is ...
We present a list of datasets and their best models with the goal of advancing the state-of-the-art ...
Building robust and generic object detection frameworks requires scaling to larger label spaces and ...