Toward Remote Sensing Image Retrieval Under a Deep Image Captioning Perspective

被引:48
|
作者
Hoxha, Genc [1 ]
Melgani, Farid [1 ]
Demir, Begum [2 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy
[2] Tech Univ Berlin, Fac Elect Engn & Comp Sci, D-10623 Berlin, Germany
基金
欧洲研究理事会;
关键词
Visualization; Image retrieval; Feature extraction; Semantics; Integrated circuits; Recurrent neural networks; Remote sensing; Convolutional neural network; deep learning; image captioning; image retrieval; recurrent neural network; remote sensing; semantic gap; GRAPH; MODELS;
D O I
10.1109/JSTARS.2020.3013818
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The performance of remote sensing image retrieval (RSIR) systems depends on the capability of the extracted features in characterizing the semantic content of images. Existing RSIR systems describe images by visual descriptors that model the primitives (such as different land-cover classes) present in the images. However, the visual descriptors may not be sufficient to describe the high-level complex content of RS images (e.g., attributes and relationships among different land-cover classes). To address this issue, in this article, we present an RSIR system that aims at generating and exploiting textual descriptions to accurately describe the relationships between the objects and their attributes present in RS images with captions (i.e., sentences). To this end, the proposed retrieval system consists of three main steps. The first step aims to encode the image visual features and then translate the encoded features into a textual description that summarizes the content of the image with captions. This is achieved based on the combination of a convolutional neural network with a recurrent neural network. The second step aims to convert the generated textual descriptions into semantically meaningful feature vectors. This is achieved by using the recent word embedding techniques. Finally, the last step estimates the similarity between the vectors of the textual descriptions of the query image and those of the archive images, and then retrieve the most similar images to the query image. Experimental results obtained on two different datasets show that the description of the image content with captions in the framework of RSIR leads to an accurate retrieval performance.
引用
收藏
页码:4462 / 4475
页数:14
相关论文
共 50 条
  • [1] Toward Multilabel Image Retrieval for Remote Sensing
    Imbriaco, Raffaele
    Sebastian, Clint
    Bondarev, Egor
    de With, Peter H. N.
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [2] Retrieval Topic Recurrent Memory Network for Remote Sensing Image Captioning
    Wang, Binqiang
    Zheng, Xiangtao
    Qu, Bo
    Lu, Xiaoqiang
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 256 - 270
  • [3] Deep Hash Learning for Remote Sensing Image Retrieval
    Liu, Chao
    Ma, Jingjing
    Tang, Xu
    Liu, Fang
    Zhang, Xiangrong
    Jiao, Licheng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (04): : 3420 - 3443
  • [4] Delving into Deep Representations for Remote Sensing Image Retrieval
    Hu, Fan
    Tong, Xinyi
    Xia, Gui-Song
    Zhang, Liangpei
    [J]. PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 198 - 203
  • [5] Cross-Modal Retrieval and Semantic Refinement for Remote Sensing Image Captioning
    Li, Zhengxin
    Zhao, Wenzhe
    Du, Xuanyi
    Zhou, Guangyao
    Zhang, Songlin
    [J]. REMOTE SENSING, 2024, 16 (01)
  • [6] Region Driven Remote Sensing Image Captioning
    Kumar, S. Chandeesh
    Hemalatha, M.
    Narayan, S. Badri
    Nandhini, P.
    [J]. 2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 : 32 - 40
  • [7] WordSentence Framework for Remote Sensing Image Captioning
    Wang, Qi
    Huang, Wei
    Zhang, Xueting
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (12): : 10532 - 10543
  • [8] A Systematic Survey of Remote Sensing Image Captioning
    Zhao, Beigeng
    [J]. IEEE ACCESS, 2021, 9 : 154086 - 154111
  • [9] Cohesion Intensive Deep Hashing for Remote Sensing Image Retrieval
    Han, Lirong
    Li, Peng
    Bai, Xiao
    Grecos, Christos
    Zhang, Xiaoyu
    Ren, Peng
    [J]. REMOTE SENSING, 2020, 12 (01)
  • [10] Multiscale Context Deep Hashing for Remote Sensing Image Retrieval
    Zhao, Dongjie
    Chen, Yaxiong
    Xiong, Shengwu
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 7163 - 7172