Dual-branch contrastive learning for weakly supervised object localization

被引:0
|
作者
Guo, Zebin [1 ,2 ]
Li, Dong [1 ,2 ]
Du, Zhengjun [1 ,2 ]
Seng, Bingfeng [1 ,2 ]
机构
[1] Qinghai Univ, Sch Comp Technol & Applicat, Xining 810000, Peoples R China
[2] Intelligent Comp & Applicat Lab Qinghai Prov, Xining, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Computer vision; Weakly supervised object localization; Dual-branch network; Contrastive learning;
D O I
10.1007/s10489-025-06514-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The weakly supervised object localization task uses image-level labels to train object localization models. Traditional convolutional neural network (CNN)-based methods usually localize objects using a class activation map. However, the class activation map usually suffers from the problem of activating a small part of the object that is most discriminative. Meanwhile, the methods based on the Vision Transformer can capture long-range feature dependencies but tend to ignore local feature details. In this paper, we innovatively propose a dual-branch contrastive learning (DBC) method that consists of a Transformer and a CNN branch. The method can effectively separate the background and foreground of an image and fuse the features of Transformer and CNN through contrastive learning. Specifically, the method separates the background and foreground representations of the image using the initially generated class-agnostic activation maps. Then, the representations of the same image from different branches form positive pairs for contrastive learning. The background and foreground representations from the same branch form negative pairs. Finally, the DBC method forces the model to separate the background and foreground representations through negative contrastive loss and makes the model fuse the features of two branches through positive contrastive loss. Experiments on the ILSVRC benchmark show that the proposed method can achieve a Top-1 localization accuracy of 59.9% and a GT-known localization accuracy of 71.7%, which are better metrics than those of the state-of-the-art methods with the same parameter complexity.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Dual-branch contrastive learning for weakly supervised object localizationDual-branch contrastive learning for weakly supervised object localizationZ. Guo et al.
    Zebin Guo
    Dong Li
    Zhengjun Du
    Bingfeng Seng
    Applied Intelligence, 2025, 55 (7)
  • [2] Contrastive and consistent feature learning for weakly supervised object localization and semantic segmentation
    Ki, Minsong
    Uh, Youngjung
    Lee, Wonyoung
    Byun, Hyeran
    NEUROCOMPUTING, 2021, 445 : 244 - 254
  • [3] Abnormal Fastener Recognition via Dual-Branch Supervised Contrastive Learning Network With Hard Feature Synthesis
    Wang, Jianzhu
    Wu, Jianqing
    Wang, Shengchun
    Zhao, Xinxin
    Li, Qingyong
    IEEE SENSORS JOURNAL, 2024, 24 (18) : 29365 - 29376
  • [4] Object Discovery via Contrastive Learning for Weakly Supervised Object Detection
    Seo, Jinhwan
    Bae, Wonho
    Sutherland, Danica J.
    Noh, Junhyug
    Kim, Daijin
    COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 312 - 329
  • [5] A Two-Branch Network for Weakly Supervised Object Localization
    Sun, Chang
    Ai, Yibo
    Wang, Sheng
    Zhang, Weidong
    ELECTRONICS, 2020, 9 (06) : 1 - 15
  • [6] Weakly-Supervised Contrastive Learning for Unsupervised Object Discovery
    Lv, Yunqiu
    Zhang, Jing
    Barnes, Nick
    Dai, Yuchao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2689 - 2702
  • [7] Weakly Supervised Temporal Action Localization Based on Contrastive Learning
    Hou Y.
    Li Y.
    Guo Z.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2023, 56 (01): : 73 - 80
  • [8] Weakly Supervised Contrastive Learning
    Zheng, Mingkai
    Wang, Fei
    You, Shan
    Qian, Chen
    Zhang, Changshui
    Wang, Xiaogang
    Xu, Chang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10022 - 10031
  • [9] Dual-Gradients Localization Framework for Weakly Supervised Object Localization
    Tan, Chuangchuang
    Gu, Guanghua
    Ruan, Tao
    Wei, Shikui
    Zhao, Yao
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1976 - 1984
  • [10] A dual-branch joint learning network for underwater object detection
    Wang, Bowen
    Wang, Zhi
    Guo, Wenhui
    Wang, Yanjiang
    KNOWLEDGE-BASED SYSTEMS, 2024, 293