Contrastive and consistent feature learning for weakly supervised object localization and semantic segmentation

被引：4

作者：

Ki, Minsong ^{[1
]}

Uh, Youngjung ^{[2
]}

Lee, Wonyoung ^{[3
]}

Byun, Hyeran ^{[1
,3
]}

机构：

[1] Yonsei Univ, Dept Comp Sci, Seoul, South Korea

[2] Yonsei Univ, Dept Appl Informat Engn, Seoul, South Korea

[3] Yonsei Univ, Grad Sch Artificial Intelligence, Seoul, South Korea

来源：

NEUROCOMPUTING | 2021年 / 445卷

基金：

新加坡国家研究基金会;

关键词：

Weakly supervised learning; Localization; Segmentation; Contrastive learning; Foreground consistency;

D O I：

10.1016/j.neucom.2021.03.023

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly supervised learning attempts to construct predictive models by learning with weak supervision. In this paper, we concentrate on weakly supervised object localization and semantic segmentation tasks. Existing methods are limited to focusing on narrow discriminative parts or overextending the activations to less discriminative regions even on backgrounds. To mitigate these problems, we regard the background as an important cue that guides the feature activation to cover the entire object to the right extent, and propose two novel objective functions: 1) contrastive attention loss and 2) foreground consistency loss. Contrastive attention loss draws the foreground feature and its dropped version close together and pushes the dropped foreground feature away from the background feature. Foreground consistency loss favors agreement between layers and provides early layers with a sense of objectness. Using both losses leads to balanced improvements over localization and segmentation accuracy by boosting activations on less discriminative regions but restraining the activation in the target object extent. For better optimizing the above losses, we use the non-local attention blocks to replace channel-pooled attention leading to enhanced attention maps considering the spatial similarity. Finally, our method achieves state-of-the-art localization performance on CUB-200-2011, ImageNet, and OpenImages benchmarks regarding top-1 localization accuracy, MaxBoxAccV2, and PxAP. We also demonstrate the effectiveness of our method in improving segmentation performance measured by mIoU on the PASCAL VOC dataset. (C) 2021 Elsevier B.V. All rights reserved.

引用

页码：244 / 254

页数：11

共 50 条

[1] Feature disparity learning for weakly supervised object localization
Li, Bingfeng
Ruan, Haohao
Li, Xinwei
Wang, Keping
[J]. IMAGE AND VISION COMPUTING, 2024, 145
[2] C2AM: Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation
Xie, Jinheng
Xiang, Jianfeng
Chen, Junliang
Hou, Xianxu
Zhao, Xiaodong
Shen, Linlin
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 979 - 988
[3] Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
Wei Zhai
Pingyu Wu
Kai Zhu
Yang Cao
Feng Wu
Zheng-Jun Zha
[J]. International Journal of Computer Vision, 2024, 132 (3) : 750 - 775
[4] Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
Zhai, Wei
Wu, Pingyu
Zhu, Kai
Cao, Yang
Wu, Feng
Zha, Zheng-Jun
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (03) : 750 - 775
[5] Weakly-Supervised Domain Adaptive Semantic Segmentation with Prototypical Contrastive Learning
Das, Anurag
Xian, Yongqin
Dai, Dengxin
Schiele, Bernt
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15434 - 15443
[6] A multi-strategy contrastive learning framework for weakly supervised semantic segmentation
Yuan, Kunhao
Schaefer, Gerald
Lai, Yu-Kun
Wang, Yifan
Liu, Xiyao
Guan, Lin
Fang, Hui
[J]. PATTERN RECOGNITION, 2023, 137
[7] From Weakly Supervised Object Localization to Semantic Segmentation by Probabilistic Image Modeling
Wilhelm, Thorsten
Grzeszick, Rene
Fink, Gernot A.
Woehler, Christian
[J]. 2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 119 - 125
[8] Weakly supervised object localization and segmentation in videos
Rochan, Mrigank
Rahman, Shafin
Bruce, Neil D. B.
Wang, Yang
[J]. IMAGE AND VISION COMPUTING, 2016, 56 : 1 - 12
[9] Feature Fusion for Weakly Supervised Object Localization
Tang, Xu
Song, Yonghong
Zhang, Yuanlin
[J]. 2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 2548 - 2553
[10] Anti-Adversarially Manipulated Attributions for Weakly Supervised Semantic Segmentation and Object Localization
Lee, Jungbeom
Kim, Eunji
Mok, Jisoo
Yoon, Sungroh
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1618 - 1634

← 1 2 3 4 5 →