Interactive Fusion and Correlation Network for Three-Modal Images Few-Shot Semantic Segmentation

被引:0
|
作者
He, Haolan [1 ]
Dong, Xianguo [2 ]
Zhou, Xiaofei [3 ]
Wang, Bo [3 ]
Zhang, Jiyong [3 ]
机构
[1] Hangzhou Dianzi Univ, Zhuoyue Honors Coll, Hangzhou 310018, Peoples R China
[2] Anhui Construct Engn Qual Supervis & Inspect Stn C, Anhui & Huaihe River Inst Hydraul Res, Hefei 230000, Peoples R China
[3] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Correlation; Fuses; Decoding; Feature extraction; Convolution; Visualization; Water resources; Few-shot learning; multi-modal feature fusion; semantic segmentation;
D O I
10.1109/LSP.2024.3456634
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This letter presents a novel method for three-modal images few-shot semantic segmentation. Some previous efforts fuse multiple modalities before feature correlation, while this changes the original visual information that is useful to subsequent feature matching. Others are built based on early correlation learning, which can cause details loss and thereby defects multi-modal integration. To address these challenges, we build a novel interactive fusion and correlation network (IFCNet). Specifically, the proposed fusing and correlating (FC) module performs feature correlating and attention-based multi-modal fusing interactively, which establishes effective inter-modal complementarity and benefits intra-modal query-support correlation. Furthermore, we add a multi-modal correlation (MC) module, which leverages multi-layer cosine similarity maps to enrich multi-modal visual correspondence. Experiments on the VDT-2048-5(i) dataset demonstrate the network's superior performance, which outperforms existing state-of-the-art methods in both 1-shot and 5-shot settings. The study also includes an ablation analysis to validate the contributions of the FC module and the MC module to the overall segmentation accuracy.
引用
收藏
页码:2430 / 2434
页数:5
相关论文
共 50 条
  • [21] Bimodal semantic fusion prototypical network for few-shot classification
    Huang, Xilang
    Choi, Seon Han
    INFORMATION FUSION, 2024, 109
  • [22] Quaternion-Valued Correlation Learning for Few-Shot Semantic Segmentation
    Zheng, Zewen
    Huang, Guoheng
    Yuan, Xiaochen
    Pun, Chi-Man
    Liu, Hongrui
    Ling, Wing-Kuen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2102 - 2115
  • [23] Query-support semantic correlation mining for few-shot segmentation
    Shao, Ji
    Gong, Bo
    Dai, Kanyuan
    Li, Daoliang
    Jing, Ling
    Chen, Yingyi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [24] Few-shot Segmentation and Semantic Segmentation for Underwater Imagery
    Kabir, Imran
    Shaurya, Shubham
    Maigur, Vijayalaxmi
    Thakurdesai, Nikhil
    Latnekar, Mahesh
    Raunak, Mayank
    Crandall, David
    Reza, Md Alimoor
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 11451 - 11457
  • [25] EPFNet: Edge-Prototype Fusion Network Toward Few-Shot Semantic Segmentation for Aerial Remote-Sensing Images
    Wu, Jiayi
    Qin, Chuan
    Ren, Yanli
    Feng, Guorui
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [26] POEM: A prototype cross and emphasis network for few-shot semantic segmentation
    Cheng, Xu
    Li, Haoyuan
    Deng, Shuya
    Peng, Yonghong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 234
  • [27] Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation
    Bao, Xiaoyi
    Qin, Jie
    Sun, Siyang
    Wang, Xingang
    Zheng, Yun
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 765 - 773
  • [28] Few-Shot Semantic Segmentation via Frequency Guided Neural Network
    Rao, Xiya
    Lu, Tao
    Wang, Zhongyuan
    Zhang, Yanduo
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1092 - 1096
  • [29] Self-regularized prototypical network for few-shot semantic segmentation
    Ding, Henghui
    Zhang, Hui
    Jiang, Xudong
    PATTERN RECOGNITION, 2023, 133
  • [30] APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic Segmentation
    Chen, Jiacheng
    Gao, Bin-Bin
    Lu, Zongqing
    Xue, Jing-Hao
    Wang, Chengjie
    Liao, Qingmin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4361 - 4373