C2AM: Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation

被引:50
|
作者
Xie, Jinheng
Xiang, Jianfeng
Chen, Junliang
Hou, Xianxu
Zhao, Xiaodong
Shen, Linlin [1 ]
机构
[1] Shenzhen Univ, Sch Comp Sci & Software Engn, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52688.2022.00106
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While class activation map (CAM) generated by image classification network has been widely used for weakly supervised object localization (WSOL) and semantic segmentation (WSSS), such classifiers usually focus on discriminative object regions. In this paper, we propose Contrastive learning for Class-agnostic Activation Map (C(2)AM) generation only using unlabeled image data, without the involvement of image-level supervision. The core idea comes from the observation that i) semantic information of foreground objects usually differs from their backgrounds; ii) foreground objects with similar appearance or background with similar color/texture have similar representations in the feature space. We form the positive and negative pairs based on the above relations and force the network to disentangle foreground and background with a class-agnostic activation map using a novel contrastive loss. As the network is guided to discriminate cross-image foreground-background, the class-agnostic activation maps learned by our approach generate more complete object regions. We successfully extracted from C(2)AM class-agnostic object bounding boxes for object localization and background cues to refine CAM generated by classification network for semantic segmentation. Extensive experiments on CUB-200-2011, ImageNet-1K, and PASCAL VOC2012 datasets show that both WSOL and WSSS can benefit from the proposed C(2)AM. Code will be available at https://github.com/CVISZU/CCAM.
引用
收藏
页码:979 / 988
页数:10
相关论文
共 50 条
  • [1] Learning class-agnostic masks with cross-task refinement for weakly supervised semantic segmentation
    Lian Xu
    Mohammed Bennamoun
    Farid Boussaid
    Wanli Ouyang
    Dan Xu
    [J]. Neural Computing and Applications, 2023, 35 : 20189 - 20205
  • [2] Learning class-agnostic masks with cross-task refinement for weakly supervised semantic segmentation
    Xu, Lian
    Bennamoun, Mohammed
    Boussaid, Farid
    Ouyang, Wanli
    Xu, Dan
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 20189 - 20205
  • [3] Contrastive and consistent feature learning for weakly supervised object localization and semantic segmentation
    Ki, Minsong
    Uh, Youngjung
    Lee, Wonyoung
    Byun, Hyeran
    [J]. NEUROCOMPUTING, 2021, 445 : 244 - 254
  • [4] Weakly-supervised Instance Segmentation via Class-agnostic Learning with Salient Images
    Wang, Xinggang
    Feng, Jiapei
    Hu, Bin
    Ding, Qi
    Ran, Longjin
    Chen, Xiaoxin
    Liu, Wenyu
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10220 - 10230
  • [5] IMPROVING CLASS ACTIVATION MAP FOR WEAKLY SUPERVISED OBJECT LOCALIZATION
    Zhang, Zhenfei
    Chang, Ming-Ching
    But, Tien D.
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2624 - 2628
  • [6] Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
    Wei Zhai
    Pingyu Wu
    Kai Zhu
    Yang Cao
    Feng Wu
    Zheng-Jun Zha
    [J]. International Journal of Computer Vision, 2024, 132 (3) : 750 - 775
  • [7] Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
    Zhai, Wei
    Wu, Pingyu
    Zhu, Kai
    Cao, Yang
    Wu, Feng
    Zha, Zheng-Jun
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (03) : 750 - 775
  • [8] Adversarial Learning of Object-Aware Activation Map for Weakly-Supervised Semantic Segmentation
    Chen, Junliang
    Lu, Weizeng
    Li, Yuexiang
    Shen, Linlin
    Duan, Jinming
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3935 - 3946
  • [9] FreqCAM: Frequent Class Activation Map for Weakly Supervised Object Localization
    Zhang, Runsheng
    [J]. PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 677 - 680
  • [10] ICA-Net: improving class activation for weakly supervised semantic segmentation via joint contrastive and simulation learning
    YE Zhuang
    LIU Ruyu
    SUN Bo
    [J]. Optoelectronics Letters., 2025, 21 (03) - 192