A Systematic Survey of Remote Sensing Image Captioning

被引:14
|
作者
Zhao, Beigeng [1 ]
机构
[1] Criminal Invest Police Univ China, Coll Publ Secur Informat Technol & Intelligence, Shenyang 110035, Peoples R China
关键词
Remote sensing; Feature extraction; Systematics; Task analysis; Deep learning; Measurement; Training; Image captioning; remote sensing; deep learning; natural language processing; OBJECT DETECTION; MODELS;
D O I
10.1109/ACCESS.2021.3128140
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image captioning is a cross-disciplinary task to automatically generate textural descriptions for a given image using computer vision and natural language processing techniques. Remote sensing image captioning refers to the application of this task to remote sensing images taken from high altitude by satellites, aircraft or drones. This interesting and valuable topic has only emerged in recent years and attracted considerable research attention. There has been extensive related work in the literature, with considerable results and an independent body of research, and various issues must be addressed in future work. However, to the best of our knowledge, there has been no review study in this area that can provide researchers with systematic reference information, which is the motivation of this study. To achieve this goal, 30 relevant articles were conditionally filtered and obtained for the review study. We analyzed and summarized the existing work from various perspectives, including technical solutions, data, evaluation metrics, and the experimental results of state-of-the-art methods. Based on this summary, the trends, pros and cons of the existing studies, issues to be addressed and valuable research directions in future work are discussed. The results of this paper can provide valuable reference information for researchers in related fields.
引用
收藏
页码:154086 / 154111
页数:26
相关论文
共 50 条
  • [1] Region Driven Remote Sensing Image Captioning
    Kumar, S. Chandeesh
    Hemalatha, M.
    Narayan, S. Badri
    Nandhini, P.
    [J]. 2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 : 32 - 40
  • [2] WordSentence Framework for Remote Sensing Image Captioning
    Wang, Qi
    Huang, Wei
    Zhang, Xueting
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (12): : 10532 - 10543
  • [3] Meta captioning: A meta learning based remote sensing image captioning framework
    Yang, Qiaoqiao
    Ni, Zihao
    Ren, Peng
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 186 : 190 - 200
  • [4] Incorporating object counts into remote sensing image captioning
    Ni, Zihao
    Zong, Zhaoyun
    Ren, Peng
    [J]. INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [5] Intensive Positioning Network for Remote Sensing Image Captioning
    Wang, Shengsheng
    Chen, Jiawei
    Wang, Guangyao
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 567 - 576
  • [6] Multiscale Multiinteraction Network for Remote Sensing Image Captioning
    Wang, Yong
    Zhang, Wenkai
    Zhang, Zhengyuan
    Gao, Xin
    Sun, Xian
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 2154 - 2165
  • [7] Structural Representative Network for Remote Sensing Image Captioning
    Sharma, Jaya
    Divya, Peketi
    Sravani, Yenduri
    Shekar, B. H.
    Mohan, Krishna C.
    [J]. FIFTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION, ICMV 2022, 2023, 12701
  • [8] Exploring region features in remote sensing image captioning
    Zhao, Kai
    Xiong, Wei
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 127
  • [9] Cooperative Connection Transformer for Remote Sensing Image Captioning
    Zhao, Kai
    Xiong, Wei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14
  • [10] GLCM: Global-Local Captioning Model for Remote Sensing Image Captioning
    Wang, Qi
    Huang, Wei
    Zhang, Xueting
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (11) : 6910 - 6922