Current object detectors are limited in vocabulary size due to the small scale of detection datasets. Image classifiers, on the other hand, reason about much larger vocabularies, as their datasets are larger and easier to collect. We propose Detic, which simply trains the classifiers of a detector on image classification data and thus expands the vocabulary of detectors to tens of thousands of concepts. Unlike prior work, Detic does not need complex assignment schemes to assign image labels to boxes based on model predictions, making it much easier to implement and compatible with a range of detection architectures and backbones. Our results show that Detic yields excellent detectors even for classes without box annotations. It outperforms ...
Scalability of object detectors with respect to the number of classes is a very important issue for ...
DETR-based object detectors have achieved remarkable performance but are sample-inefficient and exhi...
Object Detection is the task of classification andlocalization of objects in an image or video. It h...
Object detection is a fundamental computer vision task that estimates object classification labels a...
A major challenge in scaling object detection is the difficulty of obtaining la-beled images for lar...
International audienceWeakly-supervised object detection attempts to limit the amount of supervision...
We address the problem of training Object Detection models using significantly less bounding box ann...
More and more datasets have increased their size with enough class annotations. Although the classif...
We investigate the use of deep neural networks for the novel task of class-generic object detec-tion...
This thesis concerns the problem of object detection, which is defined as finding all instances of a...
Open-vocabulary object detection, which is concerned with the problem of detecting novel objects gui...
Abstract. Training an object class detector typically requires a large set of im-ages annotated with...
Open-set object detection aims at detecting arbitrary categories beyond those seen during training. ...
Weakly supervised learning of object detection is an important problem in image understanding that s...
Abstract. Viola and Jones [VJ] demonstrate that cascade classification methods can successfully dete...
Scalability of object detectors with respect to the number of classes is a very important issue for ...
DETR-based object detectors have achieved remarkable performance but are sample-inefficient and exhi...
Object Detection is the task of classification andlocalization of objects in an image or video. It h...
Object detection is a fundamental computer vision task that estimates object classification labels a...
A major challenge in scaling object detection is the difficulty of obtaining la-beled images for lar...
International audienceWeakly-supervised object detection attempts to limit the amount of supervision...
We address the problem of training Object Detection models using significantly less bounding box ann...
More and more datasets have increased their size with enough class annotations. Although the classif...
We investigate the use of deep neural networks for the novel task of class-generic object detec-tion...
This thesis concerns the problem of object detection, which is defined as finding all instances of a...
Open-vocabulary object detection, which is concerned with the problem of detecting novel objects gui...
Abstract. Training an object class detector typically requires a large set of im-ages annotated with...
Open-set object detection aims at detecting arbitrary categories beyond those seen during training. ...
Weakly supervised learning of object detection is an important problem in image understanding that s...
Abstract. Viola and Jones [VJ] demonstrate that cascade classification methods can successfully dete...
Scalability of object detectors with respect to the number of classes is a very important issue for ...
DETR-based object detectors have achieved remarkable performance but are sample-inefficient and exhi...
Object Detection is the task of classification andlocalization of objects in an image or video. It h...