Multiscale deep feature selection fusion network for referring image segmentation

被引:0
|
作者
Dai, Xianwen [1 ]
Lin, Jiacheng [1 ]
Nai, Ke [1 ]
Li, Qingpeng [2 ]
Li, Zhiyong [1 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha, Peoples R China
[2] Hunan Univ, Sch Robot, Changsha, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Referring image segmentation; Semantic segmentation; Multi-modal fusion; Deep learning;
D O I
10.1007/s11042-023-16913-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Referring image segmentation has attracted extensive attention in recent years. Previous methods have explored the difficult alignment between visual and textual features, but this problem has not been effectively addressed. This leads to the problem of insufficient interaction between visual features and textual features, which affects model performance. To this end, we propose a language-aware pixel feature fusion module (LPFFM) based on self-attention mechanism to ensure that the features of the two modalities have sufficient interaction in the space and channels. Then we apply it in the shallow to deep layers of the encoder to gradually select visual features related to the text. Secondly, we propose a second selection mechanism to further select visual features that only contain the target. For this mechanism, we design an attention contrastive loss to better suppress irrelevant background information. Further, we propose a multi-scale deep features selection fusion network (MDSFNet) based on the U-net architecture. Finally, the experimental results show that our proposed method is competitive with previous methods, improving the performance by 2.87%, 3.17%, and 3.81% on three benchmark datasets, RefCOCO, RefCOCO+, and G-ref, respectively.
引用
收藏
页码:36287 / 36305
页数:19
相关论文
共 50 条
  • [21] Hyperspectral Image Classification With Deep Feature Fusion Network
    Song, Weiwei
    Li, Shutao
    Fang, Leyuan
    Lu, Ting
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (06): : 3173 - 3184
  • [22] Multiscale Fusion Network Based on Global Weighting for Hyperspectral Feature Selection
    Wang, Jinjin
    Liu, Jiahang
    Cui, Jian
    Luan, Ji
    Fu, Yangyu
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 2977 - 2991
  • [23] A New Multilevel Feature Fusion Network for Medical Image Segmentation
    Xiaojing Qiu
    [J]. Sensing and Imaging, 2021, 22
  • [24] A New Multilevel Feature Fusion Network for Medical Image Segmentation
    Qiu, Xiaojing
    [J]. SENSING AND IMAGING, 2021, 22 (01):
  • [25] FFANet: Feature fusion attention network to medical image segmentation
    Yu, Jiankang
    Yang, Dedong
    Zhao, Hanshuo
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 69 (69)
  • [26] A Deep Supervised Pavement Crack Detection Network with Multiscale Feature Fusion and Feature Learning
    Yang, Lei
    Huang, Hanyun
    Kong, Shuyi
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 1002 - 1007
  • [27] Attention-enhanced multiscale feature fusion network for pancreas and tumor segmentation
    Dong, Kaiqi
    Hu, Peijun
    Zhu, Yan
    Tian, Yu
    Li, Xiang
    Zhou, Tianshu
    Bai, Xueli
    Liang, Tingbo
    Li, Jingsong
    [J]. MEDICAL PHYSICS, 2024,
  • [28] Attention-enhanced multiscale feature fusion network for pancreas and tumor segmentation
    Engineering Research Center of EMR and Intelligent Expert System, Ministry of Education, College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China
    不详
    不详
    [J]. Med. Phys.,
  • [29] Structured Attention Network for Referring Image Segmentation
    Lin, Liang
    Yan, Pengxiang
    Xu, Xiaoqian
    Yang, Sibei
    Zeng, Kun
    Li, Guanbin
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1922 - 1932
  • [30] Spectrum Selection and Deep Feature Fusion based Hyperspectral Image Natural Scene Classification Network
    Guo, Weilong
    Zhao, Zifei
    Kou, Longxuan
    Lu, Junjie
    Xiong, Shaopan
    Zhou, Zhuang
    Li, Shengyang
    Wu, Wei
    [J]. GLOBAL INTELLIGENT INDUSTRY CONFERENCE 2020, 2021, 11780