Target-Aware Bi-Transformer for Few-Shot Segmentation

被引:0
|
作者
Wang, Xianglin [1 ]
Luo, Xiaoliu [1 ]
Zhang, Taiping [1 ]
机构
[1] Chongqing Univ, Chongqing, Peoples R China
关键词
Semantic segmentation; Few-shot learning; Transformer;
D O I
10.1007/978-981-99-8432-9_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional semantic segmentation tasks require a large number of labels and are difficult to identify unlearned categories. Few-shot semantic segmentation (FSS) aims to use limited labeled support images to identify the segmentation of new classes of objects, which is very practical in the real world. Previous researches were primarily based on prototypes or correlations. Due to colors, textures, and styles are similar in the same image, we argue that the query image can be regarded as its own support image. In this paper, we proposed the Target-aware Bi-Transformer Network (TBTNet) to equivalent treat of support images and query image. A vigorous Target-aware Transformer Layer (TTL) also be designed to distill correlations and force the model to focus on fore-ground information. It treats the hypercorrelation as a feature, resulting a significant reduction in the number of feature channels. Benefit from this characteristic, our model is the lightest up to now with only 0.4M learnable parameters. Furthermore, TBTNet converges in only 10% to 25% of the training epochs compared to traditional methods. The excellent performance on standard FSS benchmarks of PASCAL-5(i) and COCO-20(i) proves the efficiency of our method. Extensive ablation studies were also carried out to evaluate the effectiveness of Bi-Transformer architecture and TTL.
引用
收藏
页码:440 / 452
页数:13
相关论文
共 50 条
  • [1] Target-aware for Few-shot Segmentation
    Luo, XiaoLiu
    Zhang, Taiping
    Duan, Zhao
    Tan, Jin
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [2] Fast target-aware learning for few-shot video object segmentation
    Yadang CHEN
    Chuanyan HAO
    Zhi-Xin YANG
    Enhua WU
    [J]. Science China(Information Sciences), 2022, 65 (08) : 71 - 86
  • [3] Fast target-aware learning for few-shot video object segmentation
    Chen, Yadang
    Hao, Chuanyan
    Yang, Zhi-Xin
    Wu, Enhua
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (08)
  • [4] Fast target-aware learning for few-shot video object segmentation
    Yadang Chen
    Chuanyan Hao
    Zhi-Xin Yang
    Enhua Wu
    [J]. Science China Information Sciences, 2022, 65
  • [5] Few-Shot Stance Detection via Target-Aware Prompt Distillation
    Jiang, Yan
    Gao, Jinhua
    Shen, Huawei
    Cheng, Xueqi
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 837 - 847
  • [6] Incorporating target-aware knowledge into prompt-tuning for few-shot stance detection
    Wang, Shaokang
    Sun, Fuhui
    Wang, Xiaoyan
    Pan, Li
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (05)
  • [7] Adaptive Agent Transformer for Few-Shot Segmentation
    Wang, Yuan
    Sun, Rui
    Zhang, Zhe
    Zhang, Tianzhu
    [J]. COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 36 - 52
  • [8] Dynamic Transformer for Few-shot Instance Segmentation
    Wang, Haochen
    Liu, Jie
    Liu, Yongtuo
    Maji, Subhransu
    Sonke, Jan-Jakob
    Gavves, Efstratios
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2969 - 2977
  • [9] Mask Matching Transformer for Few-Shot Segmentation
    Jiao, Siyu
    Zhang, Gengwei
    Navasardyan, Shant
    Chen, Ling
    Zhao, Yao
    Wei, Yunchao
    Shi, Humphrey
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [10] A lightweight siamese transformer for few-shot semantic segmentation
    Zhu, Hegui
    Zhou, Yange
    Jiang, Cong
    Yang, Lianping
    Jiang, Wuming
    Wang, Zhimu
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7455 - 7469