Interactive Fusion and Correlation Network for Three-Modal Images Few-Shot Semantic Segmentation

被引:0
|
作者
He, Haolan [1 ]
Dong, Xianguo [2 ]
Zhou, Xiaofei [3 ]
Wang, Bo [3 ]
Zhang, Jiyong [3 ]
机构
[1] Hangzhou Dianzi Univ, Zhuoyue Honors Coll, Hangzhou 310018, Peoples R China
[2] Anhui Construct Engn Qual Supervis & Inspect Stn C, Anhui & Huaihe River Inst Hydraul Res, Hefei 230000, Peoples R China
[3] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Correlation; Fuses; Decoding; Feature extraction; Convolution; Visualization; Water resources; Few-shot learning; multi-modal feature fusion; semantic segmentation;
D O I
10.1109/LSP.2024.3456634
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This letter presents a novel method for three-modal images few-shot semantic segmentation. Some previous efforts fuse multiple modalities before feature correlation, while this changes the original visual information that is useful to subsequent feature matching. Others are built based on early correlation learning, which can cause details loss and thereby defects multi-modal integration. To address these challenges, we build a novel interactive fusion and correlation network (IFCNet). Specifically, the proposed fusing and correlating (FC) module performs feature correlating and attention-based multi-modal fusing interactively, which establishes effective inter-modal complementarity and benefits intra-modal query-support correlation. Furthermore, we add a multi-modal correlation (MC) module, which leverages multi-layer cosine similarity maps to enrich multi-modal visual correspondence. Experiments on the VDT-2048-5(i) dataset demonstrate the network's superior performance, which outperforms existing state-of-the-art methods in both 1-shot and 5-shot settings. The study also includes an ablation analysis to validate the contributions of the FC module and the MC module to the overall segmentation accuracy.
引用
收藏
页码:2430 / 2434
页数:5
相关论文
共 50 条
  • [1] Self-Enhanced Mixed Attention Network for Three-Modal Images Few-Shot Semantic Segmentation
    Song, Kechen
    Zhang, Yiming
    Bao, Yanqi
    Zhao, Ying
    Yan, Yunhui
    SENSORS, 2023, 23 (14)
  • [2] FFNet: Feature Fusion Network for Few-shot Semantic Segmentation
    Wang, Ya-Nan
    Tian, Xiangtao
    Zhong, Guoqiang
    COGNITIVE COMPUTATION, 2022, 14 (02) : 875 - 886
  • [3] FFNet: Feature Fusion Network for Few-shot Semantic Segmentation
    Ya-Nan Wang
    Xiangtao Tian
    Guoqiang Zhong
    Cognitive Computation, 2022, 14 : 875 - 886
  • [4] Visible and thermal images fusion architecture for few-shot semantic segmentation
    Bao, Yanqi
    Song, Kechen
    Wang, Jie
    Huang, Liming
    Dong, Hongwen
    Yan, Yunhui
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 80
  • [5] Few-Shot Aerial Image Semantic Segmentation Leveraging Pyramid Correlation Fusion
    Ao, Wei
    Zheng, Shunyi
    Meng, Yan
    Gao, Zhi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 12
  • [6] PRIOR SEMANTIC HARMONIZATION NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION
    Yang, Xinhao
    Ma, Liyan
    Zhou, Yang
    Peng, Yan
    Xie, Shaorong
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1126 - 1130
  • [7] A few-shot semantic segmentation method based on adaptively mining correlation network
    Huang, Zhifu
    Jiang, Bin
    Liu, Yu
    ROBOTICA, 2023, 41 (06) : 1828 - 1836
  • [8] Few-Shot and Zero-Shot Semantic Segmentation for Food Images
    Honbu, Yuma
    Yanai, Keiji
    PROCEEDINGS OF THE 13TH INTERNATIONAL WORKSHOP ON MULTIMEDIA FOR COOKING AND EATING ACTIVITIES (CEA '21), 2021, : 25 - 28
  • [9] SANet: similarity aggregation and semantic fusion for few-shot semantic segmentation
    Ye, Minrui
    Zhang, Tao
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [10] Few-Shot Semantic Segmentation with Cyclic Memory Network
    Xie, Guo-Sen
    Xiong, Huan
    Liu, Jie
    Yao, Yazhou
    Shao, Ling
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7273 - 7282