Weakly Supervised Object Localization Based on Implicit Spatial Constraints

被引:0
|
作者
Li, Hanxin [1 ]
Jia, Ke [1 ]
Jin, Zhicheng [1 ]
Xu, Changyuan [1 ]
Zhou, Ji [1 ]
Wang, Wenrun [1 ]
机构
[1] Chengdu Univ Informat Technol, Chengdu 620225, Sichuan, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024 | 2024年 / 14867卷
关键词
Weak Supervision; Class Activation Map; Gaussian Mixture; Feature Fusion;
D O I
10.1007/978-981-97-5597-4_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly Supervised Object Localization (WSOL) tasks, as one of the most challenging tasks in the field of computer vision, aims to locate objects using only a small number of image-level labels, thus reducing annotation costs. The most popular paradigm in current weakly supervised object detection divides the WSOL task into two parts: class-agnostic object localization and object classification. After continuous optimization by subsequent scholars, models based on the Shallow Feature-aware Pseudo Label Supervised Object Localization (SPOL) paradigm have shown good performance. In this paper, we propose a spatial awareness attention module to construct implicit object spatial feature relationships in images, obtaining clear object boundaries as constraints, and then using a multidimensional convolutional attention mechanism to diffuse the target activation area. This method is designed to solve the phenomenon of class partial activation that existed in previous methods, because it can obtain clear object boundary information, which affects the activation of the overall target area. In addition, we use Gaussian mixture modeling for class-agnostic model segmentation to achieve precise object masks, which can overcome the negative impact of multiple objects and background noise on mask generation. Experiments verify that our model outperforms the baseline model on both the CUB-200-2011 and ImageNet-1K benchmarks, achieving 96.42% and 69.04% on the GT-known metric respectively (increases of 0.32% and 1.15%).
引用
收藏
页码:416 / 429
页数:14
相关论文
共 50 条
  • [21] Attention-based Dropout Layer for Weakly Supervised Object Localization
    Choe, Junsuk
    Shim, Hyunjung
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2214 - 2223
  • [22] Weakly Supervised Object Localization Based on Attention Mechanism and Categorical Hierarchy
    Feng X.
    Yang J.
    Zhou T.
    Gong C.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (10): : 4916 - 4929
  • [23] Foreground Activation Maps for Weakly Supervised Object Localization
    Meng, Meng
    Zhang, Tianzhu
    Tian, Qi
    Zhang, Yongdong
    Wu, Feng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3365 - 3375
  • [24] Adaptive attention augmentor for weakly supervised object localization
    Zhang, Longhao
    Yang, Huihua
    NEUROCOMPUTING, 2021, 454 : 474 - 482
  • [25] Weakly Supervised Object Localization with Latent Category Learning
    Wang, Chong
    Ren, Weiqiang
    Huang, Kaiqi
    Tan, Tieniu
    COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 431 - 445
  • [26] Token Masking Transformer for Weakly Supervised Object Localization
    Xu, Wenhao
    Wang, Changwei
    Xu, Rongtao
    Xu, Shibiao
    Meng, Weiliang
    Zhang, Man
    Zhang, Xiaopeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 2059 - 2069
  • [27] Rethinking erasing strategy on weakly supervised object localization
    Fan, Yuming
    Wei, Shikui
    Tan, Chuangchuang
    Chen, Xiaotong
    Yang, Dongming
    Zhao, Yao
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 135
  • [28] Aggregation of attention and erasing for weakly supervised object localization
    Koo, Bongyeong
    Choi, Han-Soo
    Kang, Myungjoo
    IMAGE AND VISION COMPUTING, 2023, 129
  • [29] Evaluating Weakly Supervised Object Localization Methods Right
    Choe, Junsuk
    Oh, Seong Joon
    Lee, Seungho
    Chun, Sanghyuk
    Akata, Zeynep
    Shim, Hyunjung
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3130 - 3139
  • [30] Progressive Representation Adaptation for Weakly Supervised Object Localization
    Li, Dong
    Huang, Jia-Bin
    Li, Yali
    Wang, Shengjin
    Yang, Ming-Hsuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (06) : 1424 - 1438