Modern detection transformers (DETRs) use a set of object queries to predict a list of bounding boxes, sort them by their classification confidence scores, and select the top-ranked predictions as the final detection results for the given input image. A highly performant object detector requires accurate ranking for the bounding box predictions. For DETR-based detectors, the top-ranked bounding boxes suffer from less accurate localization quality due to the misalignment between classification scores and localization accuracy, thus impeding the construction of high-quality detectors. In this work, we introduce a simple and highly performant DETR-based object detector by proposing a series of rank-oriented designs, combinedly called Rank-DETR...
Ranking hypothesis sets is a powerful concept for efficient object detection. In this work, we propo...
DETR-based object detectors have achieved remarkable performance but are sample-inefficient and exhi...
We present a strong object detector with encoder-decoder pretraining and finetuning. Our method, cal...
In this paper, we are interested in Detection Transformer (DETR), an end-to-end object detection app...
Detection Transformer (DETR) directly transforms queries to unique objects by using one-to-one bipar...
We present in this paper a novel denoising training method to speedup DETR (DEtection TRansformer) t...
Transformer-based object detectors (DETR) have shown significant performance across machine vision t...
Accurately ranking the vast number of candidate detections is crucial for dense object detectors to ...
Motivated by that DETR-based approaches have established new records on COCO detection and segmentat...
One-stage object detectors are trained by optimizing classification-loss and localization-loss simul...
The task of out-of-distribution (OOD) detection is crucial for deploying machine learning models in ...
Ranking hypothesis sets is a powerful concept for efficient object detection. In this work, we propo...
Detection transformer (DETR) relies on one-to-one assignment, assigning one ground-truth object to o...
Convolutional Neural Networks (CNN) have dominated the field of detection ever since the success of ...
Human-Object Interaction (HOI) detection is a core task for high-level image understanding. Recently...
Ranking hypothesis sets is a powerful concept for efficient object detection. In this work, we propo...
DETR-based object detectors have achieved remarkable performance but are sample-inefficient and exhi...
We present a strong object detector with encoder-decoder pretraining and finetuning. Our method, cal...
In this paper, we are interested in Detection Transformer (DETR), an end-to-end object detection app...
Detection Transformer (DETR) directly transforms queries to unique objects by using one-to-one bipar...
We present in this paper a novel denoising training method to speedup DETR (DEtection TRansformer) t...
Transformer-based object detectors (DETR) have shown significant performance across machine vision t...
Accurately ranking the vast number of candidate detections is crucial for dense object detectors to ...
Motivated by that DETR-based approaches have established new records on COCO detection and segmentat...
One-stage object detectors are trained by optimizing classification-loss and localization-loss simul...
The task of out-of-distribution (OOD) detection is crucial for deploying machine learning models in ...
Ranking hypothesis sets is a powerful concept for efficient object detection. In this work, we propo...
Detection transformer (DETR) relies on one-to-one assignment, assigning one ground-truth object to o...
Convolutional Neural Networks (CNN) have dominated the field of detection ever since the success of ...
Human-Object Interaction (HOI) detection is a core task for high-level image understanding. Recently...
Ranking hypothesis sets is a powerful concept for efficient object detection. In this work, we propo...
DETR-based object detectors have achieved remarkable performance but are sample-inefficient and exhi...
We present a strong object detector with encoder-decoder pretraining and finetuning. Our method, cal...