APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic Segmentation

被引:19
|
作者
Chen, Jiacheng [1 ]
Gao, Bin-Bin [2 ]
Lu, Zongqing [1 ]
Xue, Jing-Hao [3 ]
Wang, Chengjie [2 ]
Liao, Qingmin [1 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen 518055, Peoples R China
[2] Tencent YouTu Lab, Shenzhen 518057, Peoples R China
[3] UCL, Dept Stat Sci, London WC1E 6BT, England
关键词
Contrastive learning; few-shot learning; metric learning; self-supervised learning; semantic segmentation;
D O I
10.1109/TMM.2022.3174405
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Few-shot semantic segmentation aims to segment novel-class objects in a given query image with only a few labeled support images. Most advanced solutions exploit a metric learning framework that performs segmentation through matching each query feature to a learned class-specific prototype. However, this framework suffers from biased classification due to incomplete feature comparisons. To address this issue, we present an adaptive prototype representation by introducing class-specific and class-agnostic prototypes and thus construct complete sample pairs for learning semantic alignment with query features. The complementary features learning manner effectively enriches feature comparison and helps yield an unbiased segmentation model in the few-shot setting. It is implemented with a two-branch end-to-end network (i.e., a class-specific branch and a class-agnostic branch), which generates prototypes and then combines query features to perform comparisons. In addition, the proposed class-agnostic branch is simple yet effective. In practice, it can adaptively generate multiple class-agnostic prototypes for query images and learn feature alignment in a self-contrastive manner. Extensive experiments on PASCAL-5(i) and COCO-20(i) demonstrate the superiority of our method. At no expense of inference efficiency, our model achieves state-of-the-art results in both 1-shot and 5-shot settings for semantic segmentation.
引用
收藏
页码:4361 / 4373
页数:13
相关论文
共 50 条
  • [1] Learning Orthogonal Prototypes for Generalized Few-shot Semantic Segmentation
    Liu, Sun-Ao
    Zhang, Yiheng
    Qiu, Zhaofan
    Xie, Hongtao
    Zhang, Yongdong
    Yao, Ting
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11319 - 11328
  • [2] PRIOR SEMANTIC HARMONIZATION NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION
    Yang, Xinhao
    Ma, Liyan
    Zhou, Yang
    Peng, Yan
    Xie, Shaorong
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1126 - 1130
  • [3] PANet: Few-Shot Image Semantic Segmentation with Prototype Alignment
    Wang, Kaixin
    Liew, Jun Hao
    Zou, Yingtian
    Zhou, Daquan
    Feng, Jiashi
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9196 - 9205
  • [4] Few-Shot Semantic Segmentation with Cyclic Memory Network
    Xie, Guo-Sen
    Xiong, Huan
    Liu, Jie
    Yao, Yazhou
    Shao, Ling
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7273 - 7282
  • [5] Deep Reasoning Network for Few-shot Semantic Segmentation
    Zhuge, Yunzhi
    Shen, Chunhua
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5344 - 5352
  • [6] Exploring Hierarchical Prototypes for Few-Shot Segmentation
    Chen, Yaozong
    Cao, Wenming
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 42 - 53
  • [7] Learning prototypes from background and latent objects for few-shot semantic segmentation
    Wang, Yicong
    Huang, Rong
    Zhou, Shubo
    Jiang, Xueqin
    Fang, Zhijun
    KNOWLEDGE-BASED SYSTEMS, 2025, 314
  • [8] Generalized Few-shot Semantic Segmentation
    Tian, Zhuotao
    Lai, Xin
    Jiang, Li
    Liu, Shu
    Shu, Michelle
    Zhao, Hengshuang
    Jia, Jiaya
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11553 - 11562
  • [9] A Transformer-based Adaptive Prototype Matching Network for Few-Shot Semantic Segmentation
    Chen, Sihan
    Chen, Yadang
    Zheng, Yuhui
    Yang, Zhi-Xin
    Wu, Enhua
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 659 - 667
  • [10] Dynamic Prototype Convolution Network for Few-Shot Semantic Segmentation
    Liu, Jie
    Bao, Yanqi
    Xie, Guo-Sen
    Xiong, Huan
    Sonke, Jan-Jakob
    Gavves, Efstratios
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11543 - 11552