Strengthen Learning Tolerance for Weakly Supervised Object Localization

被引:34
|
作者
Guo, Guangyu [1 ]
Han, Junwei [1 ]
Wan, Fang [2 ]
Zhang, Dingwen [1 ]
机构
[1] Northwestern Polytech Univ, Brain & Artificial Intelligence Lab, Xian, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR46437.2021.00732
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised object localization (WSOL) aims at learning to localize objects of interest by only using the image-level labels as the supervision. While numerous efforts have been made in this field, recent approaches still suffer from two challenges: one is the part domination issue while the other is the learning robustness issue. Specifically, the former makes the localizer prone to the local discriminative object regions rather than the desired whole object, and the latter makes the localizer over-sensitive to the variations of the input images so that one can hardly obtain localization results robust to the arbitrary visual stimulus. To solve these issues, we propose a novel framework to strengthen the learning tolerance, referred to as SLT-Net, for WSOL. Specifically, we consider two fold learning tolerance strengthening mechanisms. One is the semantic tolerance strengthening mechanism, which allows the localizer to make mistakes for classifying similar semantics so that it will not concentrate too much on the discriminative local regions. The other is the visual stimuli tolerance strengthening mechanism, which enforces the localizer to be robust to different image transformations so that the prediction quality will not be sensitive to each specific input image. Finally, we implement comprehensive experimental comparisons on two widely-used datasets CUB and ILSVRC2012, which demonstrate the effectiveness of our proposed approach.
引用
收藏
页码:7399 / 7408
页数:10
相关论文
共 50 条
  • [1] Weakly Supervised Object Localization with Latent Category Learning
    Wang, Chong
    Ren, Weiqiang
    Huang, Kaiqi
    Tan, Tieniu
    [J]. COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 431 - 445
  • [2] Feature disparity learning for weakly supervised object localization
    Li, Bingfeng
    Ruan, Haohao
    Li, Xinwei
    Wang, Keping
    [J]. IMAGE AND VISION COMPUTING, 2024, 145
  • [3] Hierarchical complementary learning for weakly supervised object localization
    Benassou, Sabrina Narimene
    Shi, Wuzhen
    Jiang, Feng
    Benzine, Abdallah
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 100
  • [4] Adversarial Complementary Learning for Weakly Supervised Object Localization
    Zhang, Xiaolin
    Wei, Yunchao
    Feng, Jiashi
    Yang, Yi
    Huang, Thomas
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1325 - 1334
  • [5] Adaptive Zone Learning for Weakly Supervised Object Localization
    Chen, Zhiwei
    Wang, Siwei
    Cao, Liujuan
    Shen, Yunhang
    Ji, Rongrong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [6] Two-Phase Learning for Weakly Supervised Object Localization
    Kim, Dahun
    Cho, Donghyeon
    Yoo, Donggeun
    Kweon, In So
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3554 - 3563
  • [7] Weakly Supervised Learning for Object Localization Based on an Attention Mechanism
    Park, Nojin
    Ko, Hanseok
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [8] Rethinking the Localization in Weakly Supervised Object Localization
    Xu, Rui
    Luo, Yong
    Hu, Han
    Du, Bo
    Shen, Jialie
    Wen, Yonggang
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5484 - 5494
  • [9] Generalized Weakly Supervised Object Localization
    Zhang, Dingwen
    Guo, Guangyu
    Zeng, Wenyuan
    Li, Lei
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5395 - 5406
  • [10] Deep Self-Taught Learning for Weakly Supervised Object Localization
    Jie, Zequn
    Wei, Yunchao
    Jin, Xiaojie
    Feng, Jiashi
    Liu, Wei
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4294 - 4302