Towards Precise Weakly Supervised Object Detection via Interactive Contrastive Learning of Context Information

被引:0
|
作者
Lai, Qi [1 ]
Vong, Chi-Man [2 ]
Shi, Sai-Qi [3 ]
Chen, C. L. Philip [4 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] Univ Macau, Dept Comp & Informat Sci, Macau 999078, Peoples R China
[3] Univ Macau, Dept Elect & Comp Engn, Macau 999078, Peoples R China
[4] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
关键词
Context information; weakly supervised object detection; graph contrastive learning; interactive framework; NETWORK;
D O I
10.1109/TETCI.2024.3436853
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised object detection (WSOD) aims at learning precise object detectors with only image-level tags. In spite of intensive research on deep learning (DL) approaches over the past few years, there is still a significant performance gap between WSOD and fully supervised object detection. Existing WSOD methods only consider the visual appearance of each region proposal but ignore the useful context information in the image. This paper proposes an interactive end-to-end WSDO framework called JLWSOD with two innovations: i) two types of WSOD-specific context information (i.e., instance-wise correlation and semantic-wise correlation) are proposed and introduced into WSOD framework; ii) an interactive graph contrastive learning (iGCL) mechanism is designed to jointly optimize the visual appearance and context information for better WSOD performance. Specifically, the iGCL mechanism takes full advantage of the complementary interpretations of the WSOD, namely instance-wise detection and semantic-wise prediction tasks, forming a more comprehensive solution. Extensive experiments on the widely used PASCAL VOC and MS COCO benchmarks verify the superiority of JLWSOD over alternative SOTA and baseline models (improvement of 3.0%similar to 23.3% on mAP and 3.1%similar to 19.7% on CorLoc, respectively).
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Object Discovery via Contrastive Learning for Weakly Supervised Object Detection
    Seo, Jinhwan
    Bae, Wonho
    Sutherland, Danica J.
    Noh, Junhyug
    Kim, Daijin
    [J]. COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 312 - 329
  • [2] Negative Prototypes Guided Contrastive Learning for Weakly Supervised Object Detection
    Zhang, Yu
    Zhu, Chuang
    Yang, Guoqing
    Chen, Siqi
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 36 - 51
  • [3] Instance-Level Contrastive Learning for Weakly Supervised Object Detection
    Zhang, Ming
    Zeng, Bing
    [J]. SENSORS, 2022, 22 (19)
  • [4] An adaptive learning-based weakly supervised object detection via context awareness
    Zeng, Xiaoran
    Li, Zhenhua
    Zhang, Weidong
    [J]. 2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 331 - 335
  • [5] Weakly Supervised Region-Level Contrastive Learning for Efficient Object Detection
    Deng, Yuang
    Zhang, Yuhang
    Dai, Wenrui
    Zhang, Xiaopeng
    Xiong, Hongkai
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [6] Towards Precise End-to-end Weakly Supervised Object Detection Network
    Yang, Ke
    Li, Dongsheng
    Dou, Yong
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8371 - 8380
  • [7] Weakly Supervised Object Detection with Symmetry Context
    Gu, Xinyu
    Zhang, Qian
    Lu, Zheng
    [J]. SYMMETRY-BASEL, 2022, 14 (09):
  • [8] Weakly-Supervised Contrastive Learning for Unsupervised Object Discovery
    Lv, Yunqiu
    Zhang, Jing
    Barnes, Nick
    Dai, Yuchao
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2689 - 2702
  • [9] Weakly Supervised Contrastive Learning
    Zheng, Mingkai
    Wang, Fei
    You, Shan
    Qian, Chen
    Zhang, Changshui
    Wang, Xiaogang
    Xu, Chang
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10022 - 10031
  • [10] Weakly-Supervised Video Object Grounding via Stable Context Learning
    Wang, Wei
    Gao, Junyu
    Xu, Changsheng
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 760 - 768