A novel approach for image retrieval in remote sensing using vision-language-based image caption generation

被引:0
|
作者
Prem Shanker Yadav [1 ]
Dinesh Kumar Tyagi [1 ]
Santosh Kumar Vipparthi [2 ]
机构
[1] Malaviya National Institute of Technology,Department of Computer Science and Engineering
[2] Indian Institute of Technology,School of Artificial Intelligence and Data Engineering
关键词
Image caption generation; Image retrieval; Remote sensing big data; Vision language pre-training model; TF-IDF;
D O I
10.1007/s11042-024-20447-w
中图分类号
学科分类号
摘要
Recent advancements in satellite technologies have resulted in the emergence of Remote Sensing (RS) images. Hence, the primary imperative research domain is designing a precise retrieval model for retrieving the most pertinent images based on the query. Present Remote Sensing Image Retrieval (RSIR) systems use visual descriptors to characterize the primitives (such as various land-cover types) that are visible in the images. However, the visual descriptors are inadequate for defining the complicated content of RS images. To solve this problem, a new model is devised for image retrieval based on image captions. The goal is to generate textual illustrations with captions to define relations amongst objects precisely. Here, image captioning is attained based on the vision-language pre-training model. The image captions are utilized for generating features like term frequency-inverse document frequency (TF-IDF), length of text, and Bag of Words. Meanwhile, query text is utilized wherein features like TF-IDF, text length, and Bag of Words are obtained. The similarity between query text features and the image captions features has been computed on the basis of a hybrid similarity measure wherein weights are tuned with the proposed Honey Badger Political Optimizer (HBPO) to retrieve the image. The proposed HBPO provided enhanced efficiency with elevated precision of 93.3%, recall of 93.7%, F1-score of 93.5%, and Recall-Oriented Understudy for Gisting Evaluation (ROUGE) of 0.441.
引用
收藏
页码:2985 / 3014
页数:29
相关论文
共 50 条
  • [41] Region-based retrieval of remote sensing image patches with adaptive image segmentation
    Li, Shijin
    Zhu, Jiali
    Zhu, Yuelong
    Feng, Jun
    OPTICAL ENGINEERING, 2012, 51 (06)
  • [42] Vehicle Detection in Remote Sensing Image Based on Machine Vision
    Zhou, Liming
    Zheng, Chang
    Yan, Haoxin
    Zuo, Xianyu
    Qiao, Baojun
    Zhou, Bing
    Fan, Minghu
    Liu, Yang
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [43] A Natural Language User Demand Semantic Model for Remote Sensing Image Retrieval
    Zhang, Xia
    Chen, Liuyuan
    Zhu, Xinyan
    INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS, PTS 1-4, 2013, 241-244 : 2897 - +
  • [44] A Novel Approach for Content Based Image Retrieval
    Singh, Nidhi
    Singh, Kanchan
    Sinha, Ashok K.
    2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, CONTROL AND INFORMATION TECHNOLOGY (C3IT-2012), 2012, 4 : 245 - 250
  • [45] TypeFormer: Multiscale Transformer With Type Controller for Remote Sensing Image Caption
    Chen, Zihang
    Wang, Junjue
    Ma, Ailong
    Zhong, Yanfei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [46] Toward Multilabel Image Retrieval for Remote Sensing
    Imbriaco, Raffaele
    Sebastian, Clint
    Bondarev, Egor
    de With, Peter H. N.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [47] DFEN: Dual Feature Enhancement Network for Remote Sensing Image Caption
    Zhao, Weihua
    Yang, Wenzhong
    Chen, Danny
    Wei, Fuyuan
    ELECTRONICS, 2023, 12 (07)
  • [48] Remote sensing image retrieval using object-based, semantic classifier techniques
    Kumar N.S.
    Arun M.
    Dangi M.K.
    International Journal of Information and Communication Technology, 2018, 13 (01) : 68 - 82
  • [49] A novel approach for change detection in remote sensing image based on saliency map
    Tian, Minghui
    Wan, Shouhong
    Yue, Lihua
    COMPUTER GRAPHICS, IMAGING AND VISUALISATION: NEW ADVANCES, 2007, : 397 - +
  • [50] A Novel Approach to Image Retrieval for Vision-Based Positioning Utilizing Graph Topology
    Elashry, Abdelgwad
    Toth, Charles
    ISPRS ANNALS OF THE PHOTOGRAMMETRY, REMOTE SENSING AND SPATIAL INFORMATION SCIENCES: VOLUME X-2-2024, 2024, : 49 - 56