Weakly Supervised Object Localization Based on Implicit Spatial Constraints

被引:0
|
作者
Li, Hanxin [1 ]
Jia, Ke [1 ]
Jin, Zhicheng [1 ]
Xu, Changyuan [1 ]
Zhou, Ji [1 ]
Wang, Wenrun [1 ]
机构
[1] Chengdu Univ Informat Technol, Chengdu 620225, Sichuan, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024 | 2024年 / 14867卷
关键词
Weak Supervision; Class Activation Map; Gaussian Mixture; Feature Fusion;
D O I
10.1007/978-981-97-5597-4_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly Supervised Object Localization (WSOL) tasks, as one of the most challenging tasks in the field of computer vision, aims to locate objects using only a small number of image-level labels, thus reducing annotation costs. The most popular paradigm in current weakly supervised object detection divides the WSOL task into two parts: class-agnostic object localization and object classification. After continuous optimization by subsequent scholars, models based on the Shallow Feature-aware Pseudo Label Supervised Object Localization (SPOL) paradigm have shown good performance. In this paper, we propose a spatial awareness attention module to construct implicit object spatial feature relationships in images, obtaining clear object boundaries as constraints, and then using a multidimensional convolutional attention mechanism to diffuse the target activation area. This method is designed to solve the phenomenon of class partial activation that existed in previous methods, because it can obtain clear object boundary information, which affects the activation of the overall target area. In addition, we use Gaussian mixture modeling for class-agnostic model segmentation to achieve precise object masks, which can overcome the negative impact of multiple objects and background noise on mask generation. Experiments verify that our model outperforms the baseline model on both the CUB-200-2011 and ImageNet-1K benchmarks, achieving 96.42% and 69.04% on the GT-known metric respectively (increases of 0.32% and 1.15%).
引用
收藏
页码:416 / 429
页数:14
相关论文
共 50 条
  • [1] Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration
    Bai, Haotian
    Zhang, Ruimao
    Wang, Jiong
    Wan, Xiang
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 612 - 628
  • [2] Spatial-Aware Token for Weakly Supervised Object Localization
    Wu, Pingyu
    Zhai, Wei
    Cao, Yang
    Luo, Jiebo
    Zha, Zheng-Jun
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1844 - 1854
  • [3] Rethinking the Localization in Weakly Supervised Object Localization
    Xu, Rui
    Luo, Yong
    Hu, Han
    Du, Bo
    Shen, Jialie
    Wen, Yonggang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5484 - 5494
  • [4] Video-based Object Recognition with Weakly Supervised Object Localization
    Liu, Yang
    Kouskouridas, Rigas
    Kim, Tae-Kyun
    Proceedings 3rd IAPR Asian Conference on Pattern Recognition ACPR 2015, 2015, : 46 - 50
  • [5] Generalized Weakly Supervised Object Localization
    Zhang, Dingwen
    Guo, Guangyu
    Zeng, Wenyuan
    Li, Lei
    Han, Junwei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5395 - 5406
  • [6] Weakly Supervised Learning for Object Localization Based on an Attention Mechanism
    Park, Nojin
    Ko, Hanseok
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [7] Weakly supervised object localization via foreground generation with foreground-background constraints
    Li, Bingfeng
    Shi, Erdong
    Ruan, Haohao
    Jiang, Zhanshuo
    Li, Xinwei
    Wang, Keping
    Wang, Shuai
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 269
  • [8] Weakly supervised object localization and segmentation in videos
    Rochan, Mrigank
    Rahman, Shafin
    Bruce, Neil D. B.
    Wang, Yang
    IMAGE AND VISION COMPUTING, 2016, 56 : 1 - 12
  • [9] Weakly Supervised Object Localization as Domain Adaption
    Zhu, Lei
    She, Qi
    Chen, Qian
    You, Yunfei
    Wang, Boyu
    Lu, Yanye
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14617 - 14626
  • [10] Weakly Supervised Object Localization with Stable Segmentations
    Galleguillos, Carolina
    Babenko, Boris
    Rabinovich, Andrew
    Belongie, Serge
    COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 : 193 - 207