A novel approach for image retrieval in remote sensing using vision-language-based image caption generation

被引:0
|
作者
Prem Shanker Yadav [1 ]
Dinesh Kumar Tyagi [1 ]
Santosh Kumar Vipparthi [2 ]
机构
[1] Malaviya National Institute of Technology,Department of Computer Science and Engineering
[2] Indian Institute of Technology,School of Artificial Intelligence and Data Engineering
关键词
Image caption generation; Image retrieval; Remote sensing big data; Vision language pre-training model; TF-IDF;
D O I
10.1007/s11042-024-20447-w
中图分类号
学科分类号
摘要
Recent advancements in satellite technologies have resulted in the emergence of Remote Sensing (RS) images. Hence, the primary imperative research domain is designing a precise retrieval model for retrieving the most pertinent images based on the query. Present Remote Sensing Image Retrieval (RSIR) systems use visual descriptors to characterize the primitives (such as various land-cover types) that are visible in the images. However, the visual descriptors are inadequate for defining the complicated content of RS images. To solve this problem, a new model is devised for image retrieval based on image captions. The goal is to generate textual illustrations with captions to define relations amongst objects precisely. Here, image captioning is attained based on the vision-language pre-training model. The image captions are utilized for generating features like term frequency-inverse document frequency (TF-IDF), length of text, and Bag of Words. Meanwhile, query text is utilized wherein features like TF-IDF, text length, and Bag of Words are obtained. The similarity between query text features and the image captions features has been computed on the basis of a hybrid similarity measure wherein weights are tuned with the proposed Honey Badger Political Optimizer (HBPO) to retrieve the image. The proposed HBPO provided enhanced efficiency with elevated precision of 93.3%, recall of 93.7%, F1-score of 93.5%, and Recall-Oriented Understudy for Gisting Evaluation (ROUGE) of 0.441.
引用
收藏
页码:2985 / 3014
页数:29
相关论文
共 50 条
  • [31] Remote sensing image retrieval using morphological texture descriptors
    Aptoula, Erchan
    Korkmaz, Semih
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [32] Image Caption Generation Using Attention Model
    Ramalakshmi, Eliganti
    Jain, Moksh Sailesh
    Uddin, Mohammed Ameer
    INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, ICIDCA 2021, 2022, 96 : 1009 - 1017
  • [33] Remote Sensing Image Retrieval Based on Regional Attention Mechanism
    Peng Yanfei
    Mei Jinye
    Wang Kaixin
    Zi Lingling
    Sang Yu
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (10)
  • [34] The Remote Sensing Image Retrieval Based on Multi-feature
    Duan Jian-bo
    Ma Cai-hong
    Liu Shi-Bin
    Zhang Jing
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XIX, 2013, 8892
  • [35] A Remote Sensing Image Retrieval Method Based on Quaternion Transformation
    Xu Y.
    Zhao X.
    Li Z.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2019, 44 (11): : 1633 - 1640
  • [36] A remote sensing image retrieval model based on semantic mining
    Liu, Tingting
    Li, Pingxiang
    Zhang, Liangpei
    Chen, Xu
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/ Geomatics and Information Science of Wuhan University, 2009, 34 (06): : 684 - 687
  • [37] Study on content-based remote sensing image retrieval
    Du, PJ
    Chen, YH
    Tang, H
    Fang, T
    IGARSS 2005: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, PROCEEDINGS, 2005, : 707 - 710
  • [38] Topic-Based Image Caption Generation
    Dash, Sandeep Kumar
    Acharya, Shantanu
    Pakray, Partha
    Das, Ranjita
    Gelbukh, Alexander
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2020, 45 (04) : 3025 - 3034
  • [39] Attention-Based Image Caption Generation
    Manasa, M.
    Sowmya, D.
    Reddy, Y. Supriya
    Sreedevi, Pogula
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, MACHINE LEARNING AND APPLICATIONS, VOL 1, ICDSMLA 2023, 2025, 1273 : 364 - 369
  • [40] Topic-Based Image Caption Generation
    Sandeep Kumar Dash
    Shantanu Acharya
    Partha Pakray
    Ranjita Das
    Alexander Gelbukh
    Arabian Journal for Science and Engineering, 2020, 45 : 3025 - 3034