Semi-DETR: Semi-Supervised Object Detection with Detection Transformers

被引:10
|
作者
Zhang, Jiacheng [1 ,2 ]
Lin, Xiangru [2 ]
Zhang, Wei [2 ]
Wang, Kuo [1 ]
Tan, Xiao [2 ]
Han, Junyu [2 ]
Ding, Errui [2 ]
Wang, Jingdong [2 ]
Li, Guanbin [1 ,3 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[2] Baidu Inc, Dept Comp Vis Technol VIS, Beijing, Peoples R China
[3] Sun Yat Sen Univ, Res Inst, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.02280
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We analyze the DETR-based framework on semi-supervised object detection (SSOD) and observe that (1) the one-to-one assignment strategy generates incorrect matching when the pseudo ground-truth bounding box is inaccurate, leading to training inefficiency; (2) DETR-based detectors lack deterministic correspondence between the input query and its prediction output, which hinders the applicability of the consistency-based regularization widely used in current SSOD methods. We present Semi-DETR, the first transformer-based end-to-end semi-supervised object detector, to tackle these problems. Specifically, we propose a Stage-wise Hybrid Matching strategy that combines the one-to-many assignment and one-to-one assignment strategies to improve the training efficiency of the first stage and thus provide high-quality pseudo labels for the training of the second stage. Besides, we introduce a Cross-view Query Consistency method to learn the semantic feature invariance of object queries from different views while avoiding the need to find deterministic query correspondence. Furthermore, we propose a Cost-based Pseudo Label Mining module to dynamically mine more pseudo boxes based on the matching cost of pseudo ground truth bounding boxes for consistency training. Extensive experiments on all SSOD settings of both COCO and Pascal VOC benchmark datasets show that our Semi-DETR method outperforms all state-of-the-art methods by clear margins.
引用
收藏
页码:23809 / 23818
页数:10
相关论文
共 50 条
  • [1] Semi-supervised Object Detection with Unlabeled Data
    Nhu-Van Nguyen
    Rigaud, Christophe
    Burie, Jean-Christophe
    [J]. PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 289 - 296
  • [2] Semi-Supervised Active Learning for Object Detection
    Chen, Sijin
    Yang, Yingyun
    Hua, Yan
    [J]. ELECTRONICS, 2023, 12 (02)
  • [3] Proposal Learning for Semi-Supervised Object Detection
    Tang, Peng
    Ramaiah, Chetan
    Wang, Yan
    Xu, Ran
    Xiong, Caiming
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2290 - 2300
  • [4] Active Teacher for Semi-Supervised Object Detection
    Mi, Peng
    Lin, Jianghang
    Zhou, Yiyi
    Shen, Yunhang
    Luo, Gen
    Sun, Xiaoshuai
    Cao, Liujuan
    Fu, Rongrong
    Xu, Qiang
    Ji, Rongrong
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14462 - 14471
  • [5] Semi-supervised Active Salient Object Detection
    Lv, Yunqiu
    Liu, Bowen
    Zhang, Jing
    Dai, Yuchao
    Li, Aixuan
    Zhang, Tong
    [J]. PATTERN RECOGNITION, 2022, 123
  • [6] Improving Localization for Semi-Supervised Object Detection
    Rossi, Leonardo
    Karimi, Akbar
    Prati, Andrea
    [J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 516 - 527
  • [7] Label Matching Semi-Supervised Object Detection
    Chen, Binbin
    Chen, Weijie
    Yang, Shicai
    Xuan, Yunyi
    Song, Jie
    Xie, Di
    Pu, Shiliang
    Song, Mingli
    Zhuang, Yueting
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14361 - 14370
  • [8] Semi-supervised Open-World Object Detection
    Mullappilly, Sahal Shaji
    Gehlot, Abhishek Singh
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    Cholakkal, Hisham
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4305 - 4314
  • [9] Uncertain region mining semi-supervised object detection
    Tianxiang Yin
    Ningzhong Liu
    Han Sun
    [J]. Applied Intelligence, 2024, 54 : 2300 - 2313
  • [10] Semi-Conv-DETR: A railway ballast bed defect detection model integrating convolutional augmentation and semi-supervised DETR
    Teng, Jing
    Long, Xujie
    Yang, Qi
    Jing, Guoqing
    Liu, Huan
    [J]. TRANSPORTATION GEOTECHNICS, 2024, 48