Dense Distinct Query for End-to-End Object Detection

被引:106
|
作者
Zhang, Shilong [1 ,3 ]
Wang, Xinjiang [2 ]
Wang, Jiaqi [1 ]
Pang, Jiangmiao [1 ]
Lyu, Chengqi [1 ]
Zhang, Wenwei [1 ,4 ]
Luo, Ping [1 ,3 ]
Chen, Kai [1 ]
机构
[1] Shanghai AI Lab, Shanghai, Peoples R China
[2] SenseTime Res, Hong Kong, Peoples R China
[3] Univ Hong Kong, Hong Kong, Peoples R China
[4] Nanyang Technol Univ, S Lab, Singapore, Singapore
基金
国家重点研发计划;
关键词
D O I
10.1109/CVPR52729.2023.00708
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One-to-one label assignment in object detection has successfully obviated the need for non-maximum suppression (NMS) as postprocessing and makes the pipeline end-to-end. However, it triggers a new dilemma as the widely used sparse queries cannot guarantee a high recall, while dense queries inevitably bring more similar queries and encounter optimization difficulties. As both sparse and dense queries are problematic, then what are the expected queries in end-to-end object detection? This paper shows that the solution should be Dense Distinct Queries (DDQ). Concretely, we first lay dense queries like traditional detectors and then select distinct ones for one-to-one assignments. DDQ blends the advantages of traditional and recent end-to-end detectors and significantly improves the performance of various detectors including FCN, R-CNN, and DETRs. Most impressively, DDQ-DETR achieves 52.1 AP on MS-COCO dataset within 12 epochs using a ResNet-50 backbone, outperforming all existing detectors in the same setting. DDQ also shares the benefit of end-to-end detectors in crowded scenes and achieves 93.8 AP on Crowd-Human. We hope DDQ can inspire researchers to consider the complementarity between traditional methods and end-to-end detectors. The source code can be found at https://github.com/jshilong/DDQ.
引用
收藏
页码:7329 / 7338
页数:10
相关论文
共 50 条
  • [1] RQFormer: Rotated Query Transformer for end-to-end oriented object detection
    Zhao, Jiaqi
    Ding, Zeyu
    Zhou, Yong
    Zhu, Hancheng
    Du, Wen-Liang
    Yao, Rui
    El Saddik, Abdulmotaleb
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 266
  • [2] End-to-End Object Detection with YOLOF
    Xi, Xing
    Huang, Yangyang
    Wu, Weiye
    Luo, Ronghua
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VII, ICIC 2024, 2024, 14868 : 101 - 112
  • [3] Enhanced Sparse Detection for End-to-End Object Detection
    Liao, Yongwei
    Chen, Gang
    Xu, Runnan
    IEEE ACCESS, 2022, 10 : 85630 - 85640
  • [4] EOOD: End-to-end oriented object detection
    Zhang, Caiguang
    Chen, Zilong
    Xiong, Boli
    Ji, Kefeng
    Kuang, Gangyao
    NEUROCOMPUTING, 2025, 621
  • [5] Intrinsic Explainability for End-to-End Object Detection
    Fernandes, Luis
    Fernandes, Joao N. D.
    Calado, Mariana
    Pinto, Joao Ribeiro
    Cerqueira, Ricardo
    Cardoso, Jaime S.
    IEEE ACCESS, 2024, 12 : 2623 - 2634
  • [6] What Makes for End-to-End Object Detection?
    Sun, Peize
    Jiang, Yi
    Xie, Enze
    Shao, Wenqi
    Yuan, Zehuan
    Wang, Changhu
    Luo, Ping
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [7] DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection
    Fang, Hao-Shu
    Xie, Yichen
    Shao, Dian
    Lu, Cewu
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1291 - 1299
  • [8] Deep interactive query design and progressive search for end-to-end detection of tiny object in aerial images
    Jin, Chuan
    Zheng, Anqi
    Wu, Zhaoying
    Tong, Changqing
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025,
  • [9] End-to-End Object Detection with Fully Convolutional Network
    Wang, Jianfeng
    Song, Lin
    Li, Zeming
    Sun, Hongbin
    Sun, Jian
    Zheng, Nanning
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15844 - 15853
  • [10] SRDD: a lightweight end-to-end object detection with transformer
    Zhu, Yuan
    Xia, Qingyuan
    Jin, Wen
    CONNECTION SCIENCE, 2022, 34 (01) : 2448 - 2465