Exploring Data and Models in SAR Ship Image Captioning

被引:2
|
作者
Zhao, Kai [1 ]
Xiong, Wei [1 ]
机构
[1] Space Engn Univ, Sci & Technol Complex Elect Syst Simulat Lab, Beijing 101400, Peoples R China
关键词
Marine vehicles; Synthetic aperture radar; Radar polarimetry; Remote sensing; Seaports; Feature extraction; Decoding; Encoding; Recurrent neural networks; SAR image; image captioning; encoder-decoder; recurrent neural network; long short-term memory network;
D O I
10.1109/ACCESS.2022.3202193
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, considerable progress has been made in ship detection in synthetic aperture radar (SAR) images; however, no research has been conducted on translating SAR ship images into flexible and accurate sentences. To explore image captions in SAR ship images, we conduct the following work: first, to better describe SAR ship images, we propose certain principles for SAR image annotation based on the characteristics of SAR images. Second, to make better use of SAR ship images, a large-scale SAR ship image captioning dataset is carefully constructed. Finally, we explore encoder-decoder models and the attention mechanism and apply these methods to the SAR ship image captioning task. We conduct detailed experiments on the proposed dataset and find that the encoder-decoder model and attention mechanism can obtain good results in the SAR ship image captioning task. The experiments also reveal that the generated sentences can accurately describe SAR ship images. This dataset has already been published on https://github.com/5132210/SSIC.git.
引用
收藏
页码:91150 / 91159
页数:10
相关论文
共 50 条
  • [1] Exploring Semantic Relationships for Image Captioning without Parallel Data
    Liu, Fenglin
    Gao, Meng
    Zhang, Tianhao
    Zou, Yuexian
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 439 - 448
  • [2] Multimodal Data Augmentation for Image Captioning using Diffusion Models
    Xiao, Changrong
    Xu, Sean Xin
    Zhang, Kunpeng
    PROCEEDINGS OF THE 1ST WORKSHOP ON LARGE GENERATIVE MODELS MEET MULTIMODAL APPLICATIONS, LGM3A 2023, 2023, : 23 - 33
  • [3] Exploring Visual Relationship for Image Captioning
    Yao, Ting
    Pan, Yingwei
    Li, Yehao
    Mei, Tao
    COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 711 - 727
  • [4] Simulation of SAR image of ship
    Ji, K
    Kuang, G
    Su, Y
    Yu, WX
    IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 1349 - 1352
  • [5] Exploring better image captioning with grid features
    Jie Yan
    Yuxiang Xie
    Yanming Guo
    Yingmei Wei
    Xidao Luan
    Complex & Intelligent Systems, 2024, 10 : 3541 - 3556
  • [6] Exploring better image captioning with grid features
    Yan, Jie
    Xie, Yuxiang
    Guo, Yanming
    Wei, Yingmei
    Luan, Xidao
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 3541 - 3556
  • [7] Image captioning by diffusion models: A survey
    Daneshfar, Fatemeh
    Bartani, Ako
    Lotfi, Pardis
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [8] THE SIMULATION OF THE SAR IMAGE OF A SHIP WAKE
    TUNALEY, JKE
    BULLER, EH
    WU, KH
    REY, MT
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 1991, 29 (01): : 149 - 156
  • [9] VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
    Chen, Jun
    Guo, Han
    Yi, Kai
    Li, Boyang
    Elhoseiny, Mohamed
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18009 - 18019
  • [10] Exploring Models and Data for Image Question Answering
    Ren, Mengye
    Kiros, Ryan
    Zemel, Richard S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28