Interactive Fusion and Correlation Network for Three-Modal Images Few-Shot Semantic Segmentation

被引:0
|
作者
He, Haolan [1 ]
Dong, Xianguo [2 ]
Zhou, Xiaofei [3 ]
Wang, Bo [3 ]
Zhang, Jiyong [3 ]
机构
[1] Hangzhou Dianzi Univ, Zhuoyue Honors Coll, Hangzhou 310018, Peoples R China
[2] Anhui Construct Engn Qual Supervis & Inspect Stn C, Anhui & Huaihe River Inst Hydraul Res, Hefei 230000, Peoples R China
[3] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Correlation; Fuses; Decoding; Feature extraction; Convolution; Visualization; Water resources; Few-shot learning; multi-modal feature fusion; semantic segmentation;
D O I
10.1109/LSP.2024.3456634
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This letter presents a novel method for three-modal images few-shot semantic segmentation. Some previous efforts fuse multiple modalities before feature correlation, while this changes the original visual information that is useful to subsequent feature matching. Others are built based on early correlation learning, which can cause details loss and thereby defects multi-modal integration. To address these challenges, we build a novel interactive fusion and correlation network (IFCNet). Specifically, the proposed fusing and correlating (FC) module performs feature correlating and attention-based multi-modal fusing interactively, which establishes effective inter-modal complementarity and benefits intra-modal query-support correlation. Furthermore, we add a multi-modal correlation (MC) module, which leverages multi-layer cosine similarity maps to enrich multi-modal visual correspondence. Experiments on the VDT-2048-5(i) dataset demonstrate the network's superior performance, which outperforms existing state-of-the-art methods in both 1-shot and 5-shot settings. The study also includes an ablation analysis to validate the contributions of the FC module and the MC module to the overall segmentation accuracy.
引用
收藏
页码:2430 / 2434
页数:5
相关论文
共 50 条
  • [41] Self-support Few-Shot Semantic Segmentation
    Fan, Qi
    Pei, Wenjie
    Tai, Yu-Wing
    Tang, Chi-Keung
    COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 : 701 - 719
  • [42] Query semantic reconstruction for background in few-shot segmentation
    Haoyan Guan
    Michael Spratling
    The Visual Computer, 2024, 40 (2) : 799 - 810
  • [43] Few-Shot Semantic Segmentation via Mask Aggregation
    Ao, Wei
    Zheng, Shunyi
    Meng, Yan
    Yang, Yang
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [44] Query semantic reconstruction for background in few-shot segmentation
    Guan, Haoyan
    Spratling, Michael
    VISUAL COMPUTER, 2024, 40 (02): : 799 - 810
  • [45] Incorporating Depth Information into Few-Shot Semantic Segmentation
    Zhang, Yifei
    Sidibe, Desire
    Morel, Olivier
    Meriaudeau, Fabrice
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3582 - 3588
  • [46] Dynamic Extension Nets for Few-shot Semantic Segmentation
    Liu, Lizhao
    Cao, Junyi
    Liu, Minqian
    Guo, Yong
    Chen, Qi
    Tan, Mingkui
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1441 - 1449
  • [47] Few-shot semantic segmentation: a review on recent approaches
    Zhaobin Chang
    Yonggang Lu
    Xingcheng Ran
    Xiong Gao
    Xiangwen Wang
    Neural Computing and Applications, 2023, 35 : 18251 - 18275
  • [48] Few-Shot Semantic Segmentation for Complex Driving Scenes
    Zhou, Jingxing
    Chen, Ruei-Bo
    Beyerer, Juergen
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 695 - 702
  • [49] Prediction Calibration for Generalized Few-Shot Semantic Segmentation
    Lu, Zhihe
    He, Sen
    Li, Da
    Song, Yi-Zhe
    Xiang, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3311 - 3323
  • [50] Cross-Domain Few-Shot Semantic Segmentation
    Lei, Shuo
    Zhang, Xuchao
    He, Jianfeng
    Chen, Fanglan
    Du, Bowen
    Lu, Chang-Tien
    COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 73 - 90