Weakly Supervised Object Localization Based on Implicit Spatial Constraints

被引:0
|
作者
Li, Hanxin [1 ]
Jia, Ke [1 ]
Jin, Zhicheng [1 ]
Xu, Changyuan [1 ]
Zhou, Ji [1 ]
Wang, Wenrun [1 ]
机构
[1] Chengdu Univ Informat Technol, Chengdu 620225, Sichuan, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024 | 2024年 / 14867卷
关键词
Weak Supervision; Class Activation Map; Gaussian Mixture; Feature Fusion;
D O I
10.1007/978-981-97-5597-4_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly Supervised Object Localization (WSOL) tasks, as one of the most challenging tasks in the field of computer vision, aims to locate objects using only a small number of image-level labels, thus reducing annotation costs. The most popular paradigm in current weakly supervised object detection divides the WSOL task into two parts: class-agnostic object localization and object classification. After continuous optimization by subsequent scholars, models based on the Shallow Feature-aware Pseudo Label Supervised Object Localization (SPOL) paradigm have shown good performance. In this paper, we propose a spatial awareness attention module to construct implicit object spatial feature relationships in images, obtaining clear object boundaries as constraints, and then using a multidimensional convolutional attention mechanism to diffuse the target activation area. This method is designed to solve the phenomenon of class partial activation that existed in previous methods, because it can obtain clear object boundary information, which affects the activation of the overall target area. In addition, we use Gaussian mixture modeling for class-agnostic model segmentation to achieve precise object masks, which can overcome the negative impact of multiple objects and background noise on mask generation. Experiments verify that our model outperforms the baseline model on both the CUB-200-2011 and ImageNet-1K benchmarks, achieving 96.42% and 69.04% on the GT-known metric respectively (increases of 0.32% and 1.15%).
引用
收藏
页码:416 / 429
页数:14
相关论文
共 50 条
  • [31] Feature disparity learning for weakly supervised object localization
    Li, Bingfeng
    Ruan, Haohao
    Li, Xinwei
    Wang, Keping
    IMAGE AND VISION COMPUTING, 2024, 145
  • [32] Weakly Supervised Object Localization Using Size Estimates
    Shi, Miaojing
    Ferrari, Vittorio
    COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 105 - 121
  • [33] DANet: Divergent Activation for Weakly Supervised Object Localization
    Xue, Haolan
    Liu, Chang
    Wan, Fang
    Jiao, Jianbin
    Ji, Xiangyang
    Ye, Qixiang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6588 - 6597
  • [34] ViTOL: Vision Transformer for Weakly Supervised Object Localization
    Gupta, Saurav
    Lakhotia, Sourav
    Rawat, Abhay
    Tallamraju, Rahul
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4100 - 4109
  • [35] Shallow Feature Matters for Weakly Supervised Object Localization
    Wei, Jun
    Wang, Qin
    Li, Zhen
    Wang, Sheng
    Zhou, S. Kevin
    Cui, Shuguang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5989 - 5997
  • [36] Hierarchical complementary learning for weakly supervised object localization
    Benassou, Sabrina Narimene
    Shi, Wuzhen
    Jiang, Feng
    Benzine, Abdallah
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 100
  • [37] Strengthen Learning Tolerance for Weakly Supervised Object Localization
    Guo, Guangyu
    Han, Junwei
    Wan, Fang
    Zhang, Dingwen
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7399 - 7408
  • [38] Weakly Supervised Object Localization with Progressive Domain Adaptation
    Su, Shuochen
    Heide, Felix
    Swanson, Robin
    Klein, Jonathan
    Callenberg, Clara
    Hullin, Matthias
    Heidrich, Wolfgang
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : CP40 - CP40
  • [39] Background Activation Suppression for Weakly Supervised Object Localization
    Wu, Pingyu
    Zhai, Wei
    Cao, Yang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14228 - 14237
  • [40] Soft Proposal Networks for Weakly Supervised Object Localization
    Zhu, Yi
    Zhou, Yanzhao
    Ye, Qixiang
    Qiu, Qiang
    Jiao, Jianbin
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1859 - 1868