Modern top-performing object detectors depend heavily on backbone networks, whose advances bring consistent performance gains through exploring more effective network structures. In this paper, we propose a novel and flexible backbone framework, namely CBNetV2, to construct high-performance detectors using existing open-sourced pre-trained backbones under the pre-training fine-tuning paradigm. In particular, CBNetV2 architecture groups multiple identical backbones, which are connected through composite connections. Specifically, it integrates the high- and low-level features of multiple backbone networks and gradually expands the receptive field to more efficiently perform object detection. We also propose a better training strategy with as...
Human detection is a special application of object recognition and is considered one of the greatest...
Object Detection is the task of classification andlocalization of objects in an image or video. It h...
More and more datasets have increased their size with enough class annotations. Although the classif...
In existing CNN based detectors, the backbone network is a very important component for basic featur...
Deep Convolutional Neural Networks (CNNs) have induced significant progress in the field of computer...
We explore the plain, non-hierarchical Vision Transformer (ViT) as a backbone network for object det...
A common practice in transfer learning is to initialize the downstream model weights by pre-training...
As the object detection dataset scale is smaller than the image recognition dataset ImageNet scale, ...
Feature pyramids have become ubiquitous in multi-scale computer vision tasks such as object detectio...
We address the challenge of training a large supernet for the object detection task, using a relativ...
In object detection, the detection backbone consumes more than half of the overall inference cost. R...
Human detection is an important task in computer vision. It is one of the most important tasks in gl...
Recently, Neural architecture search has achieved great success on classification tasks for mobile d...
Modern object detectors can rarely achieve short training time, fast inference speed, and high accur...
There has been a rising interest in running high-quality Convolutional Neural Network (CNN) models u...
Human detection is a special application of object recognition and is considered one of the greatest...
Object Detection is the task of classification andlocalization of objects in an image or video. It h...
More and more datasets have increased their size with enough class annotations. Although the classif...
In existing CNN based detectors, the backbone network is a very important component for basic featur...
Deep Convolutional Neural Networks (CNNs) have induced significant progress in the field of computer...
We explore the plain, non-hierarchical Vision Transformer (ViT) as a backbone network for object det...
A common practice in transfer learning is to initialize the downstream model weights by pre-training...
As the object detection dataset scale is smaller than the image recognition dataset ImageNet scale, ...
Feature pyramids have become ubiquitous in multi-scale computer vision tasks such as object detectio...
We address the challenge of training a large supernet for the object detection task, using a relativ...
In object detection, the detection backbone consumes more than half of the overall inference cost. R...
Human detection is an important task in computer vision. It is one of the most important tasks in gl...
Recently, Neural architecture search has achieved great success on classification tasks for mobile d...
Modern object detectors can rarely achieve short training time, fast inference speed, and high accur...
There has been a rising interest in running high-quality Convolutional Neural Network (CNN) models u...
Human detection is a special application of object recognition and is considered one of the greatest...
Object Detection is the task of classification andlocalization of objects in an image or video. It h...
More and more datasets have increased their size with enough class annotations. Although the classif...