Interactive Fusion and Correlation Network for Three-Modal Images Few-Shot Semantic Segmentation

被引:0
|
作者
He, Haolan [1 ]
Dong, Xianguo [2 ]
Zhou, Xiaofei [3 ]
Wang, Bo [3 ]
Zhang, Jiyong [3 ]
机构
[1] Hangzhou Dianzi Univ, Zhuoyue Honors Coll, Hangzhou 310018, Peoples R China
[2] Anhui Construct Engn Qual Supervis & Inspect Stn C, Anhui & Huaihe River Inst Hydraul Res, Hefei 230000, Peoples R China
[3] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Correlation; Fuses; Decoding; Feature extraction; Convolution; Visualization; Water resources; Few-shot learning; multi-modal feature fusion; semantic segmentation;
D O I
10.1109/LSP.2024.3456634
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This letter presents a novel method for three-modal images few-shot semantic segmentation. Some previous efforts fuse multiple modalities before feature correlation, while this changes the original visual information that is useful to subsequent feature matching. Others are built based on early correlation learning, which can cause details loss and thereby defects multi-modal integration. To address these challenges, we build a novel interactive fusion and correlation network (IFCNet). Specifically, the proposed fusing and correlating (FC) module performs feature correlating and attention-based multi-modal fusing interactively, which establishes effective inter-modal complementarity and benefits intra-modal query-support correlation. Furthermore, we add a multi-modal correlation (MC) module, which leverages multi-layer cosine similarity maps to enrich multi-modal visual correspondence. Experiments on the VDT-2048-5(i) dataset demonstrate the network's superior performance, which outperforms existing state-of-the-art methods in both 1-shot and 5-shot settings. The study also includes an ablation analysis to validate the contributions of the FC module and the MC module to the overall segmentation accuracy.
引用
收藏
页码:2430 / 2434
页数:5
相关论文
共 50 条
  • [31] LEARNING WITH MEMORY FOR FEW-SHOT SEMANTIC SEGMENTATION
    Lu, Hongchao
    Wei, Chao
    Deng, Zhidong
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 629 - 633
  • [32] CLIP-Driven Prototype Network for Few-Shot Semantic Segmentation
    Guo, Shi-Cheng
    Liu, Shang-Kun
    Wang, Jing-Yu
    Zheng, Wei-Min
    Jiang, Cheng-Yu
    ENTROPY, 2023, 25 (09)
  • [33] MGNet: Mutual-guidance network for few-shot semantic segmentation
    Chang, Zhaobin
    Lu, Yonggang
    Wang, Xiangwen
    Ran, Xingcheng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
  • [34] DSMF-Net: Dual Semantic Metric Learning Fusion Network for Few-Shot Aerial Image Semantic Segmentation
    Qi, Xiyu
    Zhang, Yidan
    Wang, Lei
    Wu, Yifan
    Xin, Yi
    Chen, Zhan
    Ge, Yunping
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 853 - 864
  • [35] Multiscale Attention-Based Prototypical Network For Few-Shot Semantic Segmentation
    Zhang, Yifei
    Sidibe, Desire
    Morel, Olivier
    Meriaudeau, Fabrice
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7372 - 7378
  • [36] A SIMILARITY DISTILLATION GUIDED FEATURE REFINEMENT NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION
    Lyu, Shuchang
    Liu, Binghao
    Chen, Lijiang
    Zhao, Qi
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 666 - 670
  • [37] Found missing semantics: Supplemental prototype network for few-shot semantic segmentation
    Liang, Chen
    Bai, Shuang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [38] Dual-Guided Frequency Prototype Network for Few-Shot Semantic Segmentation
    Wen, Chunlin
    Huang, Hui
    Ma, Yan
    Yuan, Feiniu
    Zhu, Hongqing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8874 - 8888
  • [39] ARNET:ATTENTION-BASED REFINEMENT NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION
    Li, Rusheng
    Liu, Hanhui
    Zhu, Yuesheng
    Bai, Zhiqiang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2238 - 2242
  • [40] Scale-Aware Graph Neural Network for Few-Shot Semantic Segmentation
    Xie, Guo-Sen
    Liu, Jie
    Xiong, Huan
    Shao, Ling
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5471 - 5480