Exploring Cross-Image Pixel Contrast for Semantic Segmentation

被引:273
|
作者
Wang, Wenguan [1 ]
Zhou, Tianfei [1 ]
Yu, Fisher [1 ]
Dai, Jifeng [2 ]
Konukoglu, Ender [1 ]
Van Gool, Luc [1 ]
机构
[1] Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland
[2] SenseTime Res, Hong Kong, Peoples R China
关键词
D O I
10.1109/ICCV48922.2021.00721
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current semantic segmentation methods focus only on mining "local" context, i.e., dependencies between pixels within individual images, by context-aggregation modules (e.g., dilated convolution, neural attention) or structure-aware optimization criteria (e.g., IoU-like loss). However, they ignore "global" context of the training data, i.e., rich semantic relations between pixels across different images. Inspired by recent advance in unsupervised contrastive representation learning, we propose a pixel-wise contrastive algorithm for semantic segmentation in the fully supervised setting. The core idea is to enforce pixel embeddings belonging to a same semantic class to be more similar than embeddings from different classes. It raises a pixel-wise metric learning paradigm for semantic segmentation, by explicitly exploring the structures of labeled pixels, which were rarely explored before. Our method can be effortlessly incorporated into existing segmentation frameworks without extra overhead during testing. We experimentally show that, with famous segmentation models (i.e., DeepLabV3, HRNet, OCR) and backbones (i.e., ResNet, HRNet), our method brings performance improvements across diverse datasets (i.e., Cityscapes, PASCAL-Context, COCO-Stuff, CamVid). We expect this work will encourage our community to rethink the current de facto training paradigm in semantic segmentation.
引用
收藏
页码:7283 / 7293
页数:11
相关论文
共 50 条
  • [1] Cross-Image Pixel Contrasting for Semantic Segmentation
    Zhou, Tianfei
    Wang, Wenguan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5398 - 5412
  • [2] Cross-Image Relational Knowledge Distillation for Semantic Segmentation
    Yang, Chuanguang
    Zhou, Helong
    An, Zhulin
    Jiang, Xue
    Xu, Yongjun
    Zhang, Qian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12309 - 12318
  • [3] CIAN: Cross-Image Affinity Net for Weakly Supervised Semantic Segmentation
    Fan, Junsong
    Zhang, Zhaoxiang
    Tan, Tieniu
    Song, Chunfeng
    Xiao, Jun
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10762 - 10769
  • [4] Weakly Supervised Semantic Segmentation in Aerial Imagery via Cross-Image Semantic Mining
    Zhou, Ruixue
    Yuan, Zhiqiang
    Rong, Xuee
    Ma, Weicong
    Sun, Xian
    Fu, Kun
    Zhang, Wenkai
    REMOTE SENSING, 2023, 15 (04)
  • [5] Memory-Based Cross-Image Contexts for Weakly Supervised Semantic Segmentation
    Fan, Junsong
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6006 - 6020
  • [6] Fuzzy-based cross-image pixel contrastive learning for compact medical image segmentation
    Yecong Wan
    Mingwen Shao
    Yuanshuo Cheng
    Weiping Ding
    Multimedia Tools and Applications, 2024, 83 : 30377 - 30397
  • [7] Fuzzy-based cross-image pixel contrastive learning for compact medical image segmentation
    Wan, Yecong
    Shao, Mingwen
    Cheng, Yuanshuo
    Ding, Weiping
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (10) : 30377 - 30397
  • [8] Querying Labeled for Unlabeled: Cross-Image Semantic Consistency Guided Semi-Supervised Semantic Segmentation
    Wu, Linshan
    Fang, Leyuan
    He, Xingxin
    He, Min
    Ma, Jiayi
    Zhong, Zhun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8827 - 8844
  • [9] Mining semantic information from intra-image and cross-image for few-shot segmentation
    Yu Liu
    Yingchun Guo
    Ye Zhu
    Ming Yu
    Multimedia Tools and Applications, 2022, 81 : 18305 - 18326
  • [10] Mining semantic information from intra-image and cross-image for few-shot segmentation
    Liu, Yu
    Guo, Yingchun
    Zhu, Ye
    Yu, Ming
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (13) : 18305 - 18326