Entity Semantic Feature Fusion Network for Remote Sensing Image-Text Retrieval

被引:0
|
作者
Shui, Jianan [1 ]
Ding, Shuaipeng [1 ]
Li, Mingyong [1 ]
Ma, Yan [1 ]
机构
[1] Chongqing Normal Univ, Sch Comp & Informat Sci, Chongqing 401331, Peoples R China
来源
关键词
Remote Sensing; Image-Text Retrieval; Entity Semantic;
D O I
10.1007/978-981-97-7244-5_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, there has been remarkable progress in remote sensing image-text retrieval (RSITR), but in the past RSITR methods, researchers often try to extract features in images and texts from global and local perspectives, and the unique entity semantic contained in remote sensing images and texts rarely paid attention to, or even ignored. In this paper, we propose an Entity Semantic feature Fusion Network (ESFN), which uses the entity semantic in remote sensing images and texts to enhance the alignment degree and improve the retrieval accuracy. In the visual part, we propose a Scene Entity Filtering module (SEF), which can effectively extract significant entity semantic features from low-level feature maps. The Multi-level Adaptive Fusion module (MAF) adaptively selects the information of image features at different levels for feature fusion. In the textual part, we embed the entity semantic in the text into our textual feature extractor, so that it can have a good entity perception of remote sensing text. We designed a Text Phrase Enhancement module (TPE) to further extract and enhance entity semantic and alignment visual information in text. In addition, ESFN's experimental results on RSICD and RSITMD datasets show that R@1 and meanRecall (mR) reach 8.14, 22.16, 18.81 and 37.70 respectively, which verifies the model's perception of entity semantic in remote sensing images and texts. Through performance comparison, ablation study and visualization analysis, the effectiveness and superiority of this method are verified.
引用
收藏
页码:130 / 145
页数:16
相关论文
共 50 条
  • [41] Image-Text Fusion Sentiment Analysis Method Based on Image Semantic Translation
    Huang, Jian
    Wang, Ying
    [J]. Computer Engineering and Applications, 2023, 59 (11) : 180 - 187
  • [42] Multi-label semantic feature fusion for remote sensing image captioning
    Wang, Shuang
    Ye, Xiutiao
    Gu, Yu
    Wang, Jihui
    Meng, Yun
    Tian, Jingxian
    Hou, Biao
    Jiao, Licheng
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 184 : 1 - 18
  • [43] A FAST AND ACCURATE METHOD FOR REMOTE SENSING IMAGE-TEXT RETRIEVAL BASED ON LARGE MODEL KNOWLEDGE DISTILLATION
    Liao, Yu
    Yang, Rui
    Xie, Tao
    Xing, Hantong
    Quan, Dou
    Wang, Shuang
    Hou, Biao
    [J]. IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5077 - 5080
  • [44] AFNet: Adaptive Fusion Network for Remote Sensing Image Semantic Segmentation
    Liu, Rui
    Mi, Li
    Chen, Zhenzhong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (09): : 7871 - 7886
  • [45] Prior-Experience-Based Vision-Language Model for Remote Sensing Image-Text Retrieval
    Tang, Xu
    Huang, Dabiao
    Ma, Jingjing
    Zhang, Xiangrong
    Liu, Fang
    Jiao, Licheng
    [J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62
  • [46] Remote Sensing Image Semantic Segmentation Network Based on Multimodal Fusion
    Hu, Yuxiang
    Yu, Changhong
    Gao, Ming
    [J]. Computer Engineering and Applications, 60 (15): : 234 - 242
  • [47] Dual Semantic Relationship Attention Network for Image-Text Matching
    Wen, Keyu
    Gu, Xiaodong
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [48] Feature Interaction Based Graph Convolutional Networks for Image-Text Retrieval
    Hu, Yongli
    Gao, Feili
    Sun, Yanfeng
    Gao, Junbin
    Yin, Baocai
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT III, 2021, 12893 : 217 - 229
  • [49] Joint feature approach for image-text cross-modal retrieval
    Gao, Dihui
    Sheng, Lijie
    Xu, Xiaodong
    Miao, Qiguang
    [J]. Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2024, 51 (04): : 128 - 138
  • [50] HGAN: Hierarchical Graph Alignment Network for Image-Text Retrieval
    Guo, Jie
    Wang, Meiting
    Zhou, Yan
    Song, Bin
    Chi, Yuhao
    Fan, Wei
    Chang, Jianglong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 9189 - 9202