Weakly supervised object detection (WSOD) enables object detectors to be trained using image-level class labels. However, the practical application of current WSOD models is limited, as they operate at small scales and require extensive training and refinement. We propose the Weakly Supervised Detection Transformer, which enables efficient knowledge transfer from a large-scale pretraining dataset to WSOD finetuning on hundreds of novel objects. We leverage pretrained knowledge to improve the multiple instance learning framework used in WSOD, and experiments show our approach outperforms the state-of-the-art on datasets with twice the novel classes than previously shown.Comment: CVPR 2022 Workshop on Attention and Transformers in Visio
Weakly supervised learning of object detection is an important problem in image understanding that s...
Drones are employed in a growing number of visual recognition applications. A recent development in ...
Weakly supervised object detection (WSOD) has attracted more and more attention since it only uses i...
Weakly Supervised Object Detection (WSOD) is a task that detects objects in an image using a model t...
© 2017 ACM. A major challenge that arises in Weakly Supervised Object Detection (WSOD) is that only ...
More and more datasets have increased their size with enough class annotations. Although the classif...
Object detection in images and videos is an important topic in computer vision. In general, a large ...
Semi- and weakly-supervised learning have recently attracted considerable attention in the object de...
Weakly supervised object detection~(WSOD) has recently attracted much attention. However, the lack o...
Weakly supervised object detection (WSOD) is a challenging task that requires simultaneously learn o...
Combining simple architectures with large-scale pre-training has led to massive improvements in imag...
Object detection is closely related with video and image analysis. Under computer vision technology,...
In this paper, we consider the problem of leveraging existing fully labeled categories to improve th...
Object detection (OD), a crucial vision task, remains challenged by the lack of large training datas...
Weakly supervised object detection (WSOD) is an important issue in vision tasks. Unlike fully superv...
Weakly supervised learning of object detection is an important problem in image understanding that s...
Drones are employed in a growing number of visual recognition applications. A recent development in ...
Weakly supervised object detection (WSOD) has attracted more and more attention since it only uses i...
Weakly Supervised Object Detection (WSOD) is a task that detects objects in an image using a model t...
© 2017 ACM. A major challenge that arises in Weakly Supervised Object Detection (WSOD) is that only ...
More and more datasets have increased their size with enough class annotations. Although the classif...
Object detection in images and videos is an important topic in computer vision. In general, a large ...
Semi- and weakly-supervised learning have recently attracted considerable attention in the object de...
Weakly supervised object detection~(WSOD) has recently attracted much attention. However, the lack o...
Weakly supervised object detection (WSOD) is a challenging task that requires simultaneously learn o...
Combining simple architectures with large-scale pre-training has led to massive improvements in imag...
Object detection is closely related with video and image analysis. Under computer vision technology,...
In this paper, we consider the problem of leveraging existing fully labeled categories to improve th...
Object detection (OD), a crucial vision task, remains challenged by the lack of large training datas...
Weakly supervised object detection (WSOD) is an important issue in vision tasks. Unlike fully superv...
Weakly supervised learning of object detection is an important problem in image understanding that s...
Drones are employed in a growing number of visual recognition applications. A recent development in ...
Weakly supervised object detection (WSOD) has attracted more and more attention since it only uses i...