Weakly Supervised Object Localization Based on Implicit Spatial Constraints

被引:0
|
作者
Li, Hanxin [1 ]
Jia, Ke [1 ]
Jin, Zhicheng [1 ]
Xu, Changyuan [1 ]
Zhou, Ji [1 ]
Wang, Wenrun [1 ]
机构
[1] Chengdu Univ Informat Technol, Chengdu 620225, Sichuan, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024 | 2024年 / 14867卷
关键词
Weak Supervision; Class Activation Map; Gaussian Mixture; Feature Fusion;
D O I
10.1007/978-981-97-5597-4_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly Supervised Object Localization (WSOL) tasks, as one of the most challenging tasks in the field of computer vision, aims to locate objects using only a small number of image-level labels, thus reducing annotation costs. The most popular paradigm in current weakly supervised object detection divides the WSOL task into two parts: class-agnostic object localization and object classification. After continuous optimization by subsequent scholars, models based on the Shallow Feature-aware Pseudo Label Supervised Object Localization (SPOL) paradigm have shown good performance. In this paper, we propose a spatial awareness attention module to construct implicit object spatial feature relationships in images, obtaining clear object boundaries as constraints, and then using a multidimensional convolutional attention mechanism to diffuse the target activation area. This method is designed to solve the phenomenon of class partial activation that existed in previous methods, because it can obtain clear object boundary information, which affects the activation of the overall target area. In addition, we use Gaussian mixture modeling for class-agnostic model segmentation to achieve precise object masks, which can overcome the negative impact of multiple objects and background noise on mask generation. Experiments verify that our model outperforms the baseline model on both the CUB-200-2011 and ImageNet-1K benchmarks, achieving 96.42% and 69.04% on the GT-known metric respectively (increases of 0.32% and 1.15%).
引用
收藏
页码:416 / 429
页数:14
相关论文
共 50 条
  • [41] Adversarial Complementary Learning for Weakly Supervised Object Localization
    Zhang, Xiaolin
    Wei, Yunchao
    Feng, Jiashi
    Yang, Yi
    Huang, Thomas
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1325 - 1334
  • [42] Generative Prompt Model for Weakly Supervised Object Localization
    Zhao, Yuzhong
    Ye, Qixiang
    Wu, Weijia
    Shen, Chunhua
    Wan, Fang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6328 - 6338
  • [43] Adaptive Zone Learning for Weakly Supervised Object Localization
    Chen, Zhiwei
    Wang, Siwei
    Cao, Liujuan
    Shen, Yunhang
    Ji, Rongrong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 14
  • [44] Module of Axis-based Nexus Attention for weakly supervised object localization
    Sohn, Junghyo
    Jeon, Eunjin
    Jung, Wonsik
    Kang, Eunsong
    Suk, Heung-Il
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [45] Region-based dropout with attention prior for weakly supervised object localization
    Choe, Junsuk
    Han, Dongyoon
    Yun, Sangdoo
    Ha, Jung-Woo
    Oh, Seong Joon
    Shim, Hyunjung
    PATTERN RECOGNITION, 2021, 116
  • [46] Module of Axis-based Nexus Attention for weakly supervised object localization
    Junghyo Sohn
    Eunjin Jeon
    Wonsik Jung
    Eunsong Kang
    Heung-Il Suk
    Scientific Reports, 13
  • [47] NAD: Neuron Activation based Divergence Maps for Weakly Supervised Object Localization
    Bagga, Siddhant
    Gupta, Sarthak
    Bhatia, Mohinder Pal Singh
    Dhurandher, Sanjay Kumar
    Goyal, Anish
    2020 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2020, : 60 - 64
  • [48] Dual-Gradients Localization Framework for Weakly Supervised Object Localization
    Tan, Chuangchuang
    Gu, Guanghua
    Ruan, Tao
    Wei, Shikui
    Zhao, Yao
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1976 - 1984
  • [49] Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization
    Kim, Eunji
    Kim, Siwon
    Lee, Jungbeom
    Kim, Hyunwoo
    Yoon, Sungroh
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14238 - 14247
  • [50] Spatial Structure Constraints for Weakly Supervised Semantic Segmentation
    Chen, Tao
    Yao, Yazhou
    Huang, Xingguo
    Li, Zechao
    Nie, Liqiang
    Tang, Jinhui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1136 - 1148