Semi-DETR: Semi-Supervised Object Detection with Detection Transformers

被引:10
|
作者
Zhang, Jiacheng [1 ,2 ]
Lin, Xiangru [2 ]
Zhang, Wei [2 ]
Wang, Kuo [1 ]
Tan, Xiao [2 ]
Han, Junyu [2 ]
Ding, Errui [2 ]
Wang, Jingdong [2 ]
Li, Guanbin [1 ,3 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[2] Baidu Inc, Dept Comp Vis Technol VIS, Beijing, Peoples R China
[3] Sun Yat Sen Univ, Res Inst, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.02280
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We analyze the DETR-based framework on semi-supervised object detection (SSOD) and observe that (1) the one-to-one assignment strategy generates incorrect matching when the pseudo ground-truth bounding box is inaccurate, leading to training inefficiency; (2) DETR-based detectors lack deterministic correspondence between the input query and its prediction output, which hinders the applicability of the consistency-based regularization widely used in current SSOD methods. We present Semi-DETR, the first transformer-based end-to-end semi-supervised object detector, to tackle these problems. Specifically, we propose a Stage-wise Hybrid Matching strategy that combines the one-to-many assignment and one-to-one assignment strategies to improve the training efficiency of the first stage and thus provide high-quality pseudo labels for the training of the second stage. Besides, we introduce a Cross-view Query Consistency method to learn the semantic feature invariance of object queries from different views while avoiding the need to find deterministic query correspondence. Furthermore, we propose a Cost-based Pseudo Label Mining module to dynamically mine more pseudo boxes based on the matching cost of pseudo ground truth bounding boxes for consistency training. Extensive experiments on all SSOD settings of both COCO and Pascal VOC benchmark datasets show that our Semi-DETR method outperforms all state-of-the-art methods by clear margins.
引用
收藏
页码:23809 / 23818
页数:10
相关论文
共 50 条
  • [21] SOOD: Towards Semi-Supervised Oriented Object Detection
    Hua, Wei
    Liang, Dingkang
    Li, Jingyu
    Liu, Xiaolong
    Zou, Zhikang
    Ye, Xiaoqing
    Bai, Xiang
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15558 - 15567
  • [22] Semi-Supervised Novelty Detection
    Blanchard, Gilles
    Lee, Gyemin
    Scott, Clayton
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 2973 - 3009
  • [23] Points as Queries: Weakly Semi-supervised Object Detection by Points
    Chen, Liangyu
    Yang, Tong
    Zhang, Xiangyu
    Zhang, Wei
    Sun, Jian
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8819 - 8828
  • [24] Semi-supervised Learning of Feature Hierarchies for Object Detection in a Video
    Yang, Yang
    Shu, Guang
    Shah, Mubarak
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1650 - 1657
  • [25] SEMI-SUPERVISED OBJECT DETECTION FOR SORGHUM PANICLES IN UAV IMAGERY
    Cai, Enyu
    Guo, Jiaqi
    Yang, Changye
    Delp, Edward J.
    [J]. IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6482 - 6485
  • [26] Semi-supervised self-training of object detection models
    Rosenberg, C
    Hebert, M
    Schneiderman, H
    [J]. WACV 2005: SEVENTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2005, : 29 - 36
  • [27] Semi-Supervised and Long-Tailed Object Detection with CascadeMatch
    Zang, Yuhang
    Zhou, Kaiyang
    Huang, Chen
    Loy, Chen Change
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (04) : 987 - 1001
  • [28] Toward Semi-Supervised Graphical Object Detection in Document Images
    Kallempudi, Goutham
    Hashmi, Khurram Azeem
    Pagani, Alain
    Liwicki, Marcus
    Stricker, Didier
    Afzal, Muhammad Zeshan
    [J]. FUTURE INTERNET, 2022, 14 (06)
  • [29] Semi-Supervised Exemplar Learning for Object Detection in Aerial Imagery
    Overbey, Lucas A.
    Lyle, Jamie
    Pan, Jean
    Holt, Branson
    Jaegar, Alan
    Jaeger, Ryan
    van Epps, Todd
    Ruane, Martin
    [J]. GEOSPATIAL INFORMATICS XI, 2021, 11733
  • [30] Interpolation-based Semi-supervised Learning for Object Detection
    Jeong, Jisoo
    Verma, Vikas
    Hyun, Minsung
    Kannala, Juho
    Kwak, Nojun
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11597 - 11606