Exploring Data and Models in SAR Ship Image Captioning

被引:2
|
作者
Zhao, Kai [1 ]
Xiong, Wei [1 ]
机构
[1] Space Engn Univ, Sci & Technol Complex Elect Syst Simulat Lab, Beijing 101400, Peoples R China
关键词
Marine vehicles; Synthetic aperture radar; Radar polarimetry; Remote sensing; Seaports; Feature extraction; Decoding; Encoding; Recurrent neural networks; SAR image; image captioning; encoder-decoder; recurrent neural network; long short-term memory network;
D O I
10.1109/ACCESS.2022.3202193
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, considerable progress has been made in ship detection in synthetic aperture radar (SAR) images; however, no research has been conducted on translating SAR ship images into flexible and accurate sentences. To explore image captions in SAR ship images, we conduct the following work: first, to better describe SAR ship images, we propose certain principles for SAR image annotation based on the characteristics of SAR images. Second, to make better use of SAR ship images, a large-scale SAR ship image captioning dataset is carefully constructed. Finally, we explore encoder-decoder models and the attention mechanism and apply these methods to the SAR ship image captioning task. We conduct detailed experiments on the proposed dataset and find that the encoder-decoder model and attention mechanism can obtain good results in the SAR ship image captioning task. The experiments also reveal that the generated sentences can accurately describe SAR ship images. This dataset has already been published on https://github.com/5132210/SSIC.git.
引用
收藏
页码:91150 / 91159
页数:10
相关论文
共 50 条
  • [31] Efficient Image Captioning Based on Vision Transformer Models
    Elbedwehy, Samar
    Medhat, T.
    Hamza, Taher
    Alrahmawy, Mohammed F.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 1483 - 1500
  • [32] Exploring coherence from heterogeneous representations for OCR image captioning
    Zhang, Yao
    Song, Zijie
    Hu, Zhenzhen
    MULTIMEDIA SYSTEMS, 2024, 30 (05)
  • [33] A Survey on Attention-Based Models for Image Captioning
    Osman, Asmaa A. E.
    Shalaby, Mohamed A. Wahby
    Soliman, Mona M.
    Elsayed, Khaled M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (02) : 403 - 412
  • [34] Paying Attention to Descriptions Generated by Image Captioning Models
    Tavakoli, Hamed R.
    Shetty, Rakshith
    Borji, Ali
    Laaksonen, Jorma
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2506 - 2515
  • [35] Towards Multilingual Image Captioning Models that Can Read
    Gallardo Garcia, Rafael
    Beltran Martinez, Beatriz
    Hernandez Gracidas, Carlos
    Vilarino Ayala, Darnes
    ADVANCES IN SOFT COMPUTING (MICAI 2021), PT II, 2021, 13068 : 13 - 27
  • [36] Language Models for Image Captioning: The Quirks and What Works
    Devlin, Jacob
    Cheng, Hao
    Fang, Hao
    Gupta, Saurabh
    Deng, Li
    He, Xiaodong
    Zweig, Geoffrey
    Mitchell, Margaret
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 100 - 105
  • [37] A Super Lightweight and Efficient SAR Image Ship Detector
    Yang, Yingguang
    Ju, Yanwei
    Zhou, Ziyan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [38] Finsler Metric Method for Ship Detection in SAR Image
    Zhao, Huafei
    Yang, Meng
    PROGRESS IN ELECTROMAGNETICS RESEARCH LETTERS, 2022, 105 : 63 - 69
  • [39] Ship Detection of SAR Image in Complex Nearshore Environment
    Yuan, Jinquan
    Zhang, Zhixin
    Zhang, Peng
    2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 317 - 321
  • [40] SAR IMAGE SHIP DETECTION BASED ON SCENE INTERPRETATION
    Hou, Shilong
    Ma, Xiaorui
    Wang, Xinrong
    Fu, Zanhao
    Wang, Jie
    Wang, Hongyu
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2863 - 2866