Dual-branch contrastive learning for weakly supervised object localization

被引:0
|
作者
Guo, Zebin [1 ,2 ]
Li, Dong [1 ,2 ]
Du, Zhengjun [1 ,2 ]
Seng, Bingfeng [1 ,2 ]
机构
[1] Qinghai Univ, Sch Comp Technol & Applicat, Xining 810000, Peoples R China
[2] Intelligent Comp & Applicat Lab Qinghai Prov, Xining, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Computer vision; Weakly supervised object localization; Dual-branch network; Contrastive learning;
D O I
10.1007/s10489-025-06514-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The weakly supervised object localization task uses image-level labels to train object localization models. Traditional convolutional neural network (CNN)-based methods usually localize objects using a class activation map. However, the class activation map usually suffers from the problem of activating a small part of the object that is most discriminative. Meanwhile, the methods based on the Vision Transformer can capture long-range feature dependencies but tend to ignore local feature details. In this paper, we innovatively propose a dual-branch contrastive learning (DBC) method that consists of a Transformer and a CNN branch. The method can effectively separate the background and foreground of an image and fuse the features of Transformer and CNN through contrastive learning. Specifically, the method separates the background and foreground representations of the image using the initially generated class-agnostic activation maps. Then, the representations of the same image from different branches form positive pairs for contrastive learning. The background and foreground representations from the same branch form negative pairs. Finally, the DBC method forces the model to separate the background and foreground representations through negative contrastive loss and makes the model fuse the features of two branches through positive contrastive loss. Experiments on the ILSVRC benchmark show that the proposed method can achieve a Top-1 localization accuracy of 59.9% and a GT-known localization accuracy of 71.7%, which are better metrics than those of the state-of-the-art methods with the same parameter complexity.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] A dual-branch weakly supervised learning based network for accurate mapping of woody vegetation from remote sensing images
    Cheng, Youwei
    Lan, Shaocheng
    Fan, Xijian
    Tjahjadi, Tardi
    Jin, Shichao
    Cao, Lin
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 124
  • [22] Robust Localization-Guided Dual-Branch Network for Camouflaged Object Segmentation
    Wang, Chuanjiang
    Li, Yuepeng
    Wei, Guohui
    Hou, Xiankai
    Sun, Xiujuan
    ELECTRONICS, 2024, 13 (05)
  • [23] Two-Phase Learning for Weakly Supervised Object Localization
    Kim, Dahun
    Cho, Donghyeon
    Yoo, Donggeun
    Kweon, In So
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3554 - 3563
  • [24] Weakly Supervised Learning for Object Localization Based on an Attention Mechanism
    Park, Nojin
    Ko, Hanseok
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [25] Weakly Supervised Region-Level Contrastive Learning for Efficient Object Detection
    Deng, Yuang
    Zhang, Yuhang
    Dai, Wenrui
    Zhang, Xiaopeng
    Xiong, Hongkai
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [26] Rethinking the Localization in Weakly Supervised Object Localization
    Xu, Rui
    Luo, Yong
    Hu, Han
    Du, Bo
    Shen, Jialie
    Wen, Yonggang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5484 - 5494
  • [27] Dual-Branch Discriminative Transmission Line Bolt Image Classification Based on Contrastive Learning
    Ji, Yan-Peng
    Zhao, Jian-Li
    Liu, Liang-Shuai
    Feng, Hai-Yan
    Du, Jia-Qi
    Fang, Xia
    PROCESSES, 2025, 13 (03)
  • [28] Self-Supervised Learning With a Dual-Branch ResNet for Hyperspectral Image Classification
    Li, Tianrui
    Zhang, Xiaohua
    Zhang, Shuhan
    Wang, Li
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [29] CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning
    Zhang, Can
    Cao, Meng
    Yang, Dongming
    Chen, Jie
    Zou, Yuexian
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16005 - 16014
  • [30] Generalized Weakly Supervised Object Localization
    Zhang, Dingwen
    Guo, Guangyu
    Zeng, Wenyuan
    Li, Lei
    Han, Junwei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5395 - 5406