APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic Segmentation

被引：19

作者：

Chen, Jiacheng ^{[1
]}

Gao, Bin-Bin ^{[2
]}

Lu, Zongqing ^{[1
]}

Xue, Jing-Hao ^{[3
]}

Wang, Chengjie ^{[2
]}

Liao, Qingmin ^{[1
]}

机构：

[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen 518055, Peoples R China

[2] Tencent YouTu Lab, Shenzhen 518057, Peoples R China

[3] UCL, Dept Stat Sci, London WC1E 6BT, England

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

关键词：

Contrastive learning; few-shot learning; metric learning; self-supervised learning; semantic segmentation;

D O I：

10.1109/TMM.2022.3174405

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Few-shot semantic segmentation aims to segment novel-class objects in a given query image with only a few labeled support images. Most advanced solutions exploit a metric learning framework that performs segmentation through matching each query feature to a learned class-specific prototype. However, this framework suffers from biased classification due to incomplete feature comparisons. To address this issue, we present an adaptive prototype representation by introducing class-specific and class-agnostic prototypes and thus construct complete sample pairs for learning semantic alignment with query features. The complementary features learning manner effectively enriches feature comparison and helps yield an unbiased segmentation model in the few-shot setting. It is implemented with a two-branch end-to-end network (i.e., a class-specific branch and a class-agnostic branch), which generates prototypes and then combines query features to perform comparisons. In addition, the proposed class-agnostic branch is simple yet effective. In practice, it can adaptively generate multiple class-agnostic prototypes for query images and learn feature alignment in a self-contrastive manner. Extensive experiments on PASCAL-5(i) and COCO-20(i) demonstrate the superiority of our method. At no expense of inference efficiency, our model achieves state-of-the-art results in both 1-shot and 5-shot settings for semantic segmentation.

引用

页码：4361 / 4373

页数：13

共 50 条

[1] Learning Orthogonal Prototypes for Generalized Few-shot Semantic Segmentation
Liu, Sun-Ao
Zhang, Yiheng
Qiu, Zhaofan
Xie, Hongtao
Zhang, Yongdong
Yao, Ting
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11319 - 11328
[2] PRIOR SEMANTIC HARMONIZATION NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION
Yang, Xinhao
Ma, Liyan
Zhou, Yang
Peng, Yan
Xie, Shaorong
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1126 - 1130
[3] PANet: Few-Shot Image Semantic Segmentation with Prototype Alignment
Wang, Kaixin
Liew, Jun Hao
Zou, Yingtian
Zhou, Daquan
Feng, Jiashi
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9196 - 9205
[4] Few-Shot Semantic Segmentation with Cyclic Memory Network
Xie, Guo-Sen
Xiong, Huan
Liu, Jie
Yao, Yazhou
Shao, Ling
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7273 - 7282
[5] Deep Reasoning Network for Few-shot Semantic Segmentation
Zhuge, Yunzhi
Shen, Chunhua
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5344 - 5352
[6] Exploring Hierarchical Prototypes for Few-Shot Segmentation
Chen, Yaozong
Cao, Wenming
ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 42 - 53
[7] Learning prototypes from background and latent objects for few-shot semantic segmentation
Wang, Yicong
Huang, Rong
Zhou, Shubo
Jiang, Xueqin
Fang, Zhijun
KNOWLEDGE-BASED SYSTEMS, 2025, 314
[8] Generalized Few-shot Semantic Segmentation
Tian, Zhuotao
Lai, Xin
Jiang, Li
Liu, Shu
Shu, Michelle
Zhao, Hengshuang
Jia, Jiaya
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11553 - 11562
[9] A Transformer-based Adaptive Prototype Matching Network for Few-Shot Semantic Segmentation
Chen, Sihan
Chen, Yadang
Zheng, Yuhui
Yang, Zhi-Xin
Wu, Enhua
PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 659 - 667
[10] Dynamic Prototype Convolution Network for Few-Shot Semantic Segmentation
Liu, Jie
Bao, Yanqi
Xie, Guo-Sen
Xiong, Huan
Sonke, Jan-Jakob
Gavves, Efstratios
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11543 - 11552

← 1 2 3 4 5 →