Contrastive and consistent feature learning for weakly supervised object localization and semantic segmentation

被引:4
|
作者
Ki, Minsong [1 ]
Uh, Youngjung [2 ]
Lee, Wonyoung [3 ]
Byun, Hyeran [1 ,3 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul, South Korea
[2] Yonsei Univ, Dept Appl Informat Engn, Seoul, South Korea
[3] Yonsei Univ, Grad Sch Artificial Intelligence, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Weakly supervised learning; Localization; Segmentation; Contrastive learning; Foreground consistency;
D O I
10.1016/j.neucom.2021.03.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised learning attempts to construct predictive models by learning with weak supervision. In this paper, we concentrate on weakly supervised object localization and semantic segmentation tasks. Existing methods are limited to focusing on narrow discriminative parts or overextending the activations to less discriminative regions even on backgrounds. To mitigate these problems, we regard the background as an important cue that guides the feature activation to cover the entire object to the right extent, and propose two novel objective functions: 1) contrastive attention loss and 2) foreground consistency loss. Contrastive attention loss draws the foreground feature and its dropped version close together and pushes the dropped foreground feature away from the background feature. Foreground consistency loss favors agreement between layers and provides early layers with a sense of objectness. Using both losses leads to balanced improvements over localization and segmentation accuracy by boosting activations on less discriminative regions but restraining the activation in the target object extent. For better optimizing the above losses, we use the non-local attention blocks to replace channel-pooled attention leading to enhanced attention maps considering the spatial similarity. Finally, our method achieves state-of-the-art localization performance on CUB-200-2011, ImageNet, and OpenImages benchmarks regarding top-1 localization accuracy, MaxBoxAccV2, and PxAP. We also demonstrate the effectiveness of our method in improving segmentation performance measured by mIoU on the PASCAL VOC dataset. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:244 / 254
页数:11
相关论文
共 50 条
  • [1] Feature disparity learning for weakly supervised object localization
    Li, Bingfeng
    Ruan, Haohao
    Li, Xinwei
    Wang, Keping
    [J]. IMAGE AND VISION COMPUTING, 2024, 145
  • [2] C2AM: Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation
    Xie, Jinheng
    Xiang, Jianfeng
    Chen, Junliang
    Hou, Xianxu
    Zhao, Xiaodong
    Shen, Linlin
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 979 - 988
  • [3] Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
    Wei Zhai
    Pingyu Wu
    Kai Zhu
    Yang Cao
    Feng Wu
    Zheng-Jun Zha
    [J]. International Journal of Computer Vision, 2024, 132 (3) : 750 - 775
  • [4] Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
    Zhai, Wei
    Wu, Pingyu
    Zhu, Kai
    Cao, Yang
    Wu, Feng
    Zha, Zheng-Jun
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (03) : 750 - 775
  • [5] Weakly-Supervised Domain Adaptive Semantic Segmentation with Prototypical Contrastive Learning
    Das, Anurag
    Xian, Yongqin
    Dai, Dengxin
    Schiele, Bernt
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15434 - 15443
  • [6] A multi-strategy contrastive learning framework for weakly supervised semantic segmentation
    Yuan, Kunhao
    Schaefer, Gerald
    Lai, Yu-Kun
    Wang, Yifan
    Liu, Xiyao
    Guan, Lin
    Fang, Hui
    [J]. PATTERN RECOGNITION, 2023, 137
  • [7] From Weakly Supervised Object Localization to Semantic Segmentation by Probabilistic Image Modeling
    Wilhelm, Thorsten
    Grzeszick, Rene
    Fink, Gernot A.
    Woehler, Christian
    [J]. 2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 119 - 125
  • [8] Weakly supervised object localization and segmentation in videos
    Rochan, Mrigank
    Rahman, Shafin
    Bruce, Neil D. B.
    Wang, Yang
    [J]. IMAGE AND VISION COMPUTING, 2016, 56 : 1 - 12
  • [9] Feature Fusion for Weakly Supervised Object Localization
    Tang, Xu
    Song, Yonghong
    Zhang, Yuanlin
    [J]. 2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 2548 - 2553
  • [10] Anti-Adversarially Manipulated Attributions for Weakly Supervised Semantic Segmentation and Object Localization
    Lee, Jungbeom
    Kim, Eunji
    Mok, Jisoo
    Yoon, Sungroh
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1618 - 1634