A survey on deep neural network-based image captioning

被引:0
|
作者
Xiaoxiao Liu
Qingyang Xu
Ning Wang
机构
[1] Shandong University,School of Mechanical, Electrical and Information Engineering
[2] Dalian Maritime University,Marine Engineering College
来源
The Visual Computer | 2019年 / 35卷
关键词
Image captioning; Image understanding; Object detection; Language model; Attention mechanism; Dense captioning;
D O I
暂无
中图分类号
学科分类号
摘要
Image captioning is a hot topic of image understanding, and it is composed of two natural parts (“look” and “language expression”) which correspond to the two most important fields of artificial intelligence (“machine vision” and “natural language processing”). With the development of deep neural networks and better labeling database, the image captioning techniques have developed quickly. In this survey, the image captioning approaches and improvements based on deep neural network are introduced, including the characteristics of the specific techniques. The early image captioning approach based on deep neural network is the retrieval-based method. The retrieval method makes use of a searching technique to find an appropriate image description. The template-based method separates the image captioning process into object detection and sentence generation. Recently, end-to-end learning-based image captioning method has been verified effective at image captioning. The end-to-end learning techniques can generate more flexible and fluent sentence. In this survey, the image captioning methods are reviewed in detail. Furthermore, some remaining challenges are discussed.
引用
收藏
页码:445 / 470
页数:25
相关论文
共 50 条
  • [1] A survey on deep neural network-based image captioning
    Liu, Xiaoxiao
    Xu, Qingyang
    Wang, Ning
    [J]. VISUAL COMPUTER, 2019, 35 (03): : 445 - 470
  • [2] Hierarchical Deep Neural Network for Image Captioning
    Su, Yuting
    Li, Yuqian
    Xu, Ning
    Liu, An-An
    [J]. NEURAL PROCESSING LETTERS, 2020, 52 (02) : 1057 - 1067
  • [3] Hierarchical Deep Neural Network for Image Captioning
    Yuting Su
    Yuqian Li
    Ning Xu
    An-An Liu
    [J]. Neural Processing Letters, 2020, 52 : 1057 - 1067
  • [4] Graph neural network-based visual relationship and multilevel attention for image captioning
    Sharma, Himanshu
    Srivastava, Swati
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (05)
  • [5] Deep Neural Network-based Enhancement for Image and Video Streaming Systems: A Survey and Future Directions
    Lee, Royson
    Venieris, Stylianos, I
    Lane, Nicholas D.
    [J]. ACM COMPUTING SURVEYS, 2021, 54 (08)
  • [6] Deep neural network-based image copyright protection scheme
    Lu, Haoyu
    Gong, Daofu
    Liu, Fenlin
    Wang, Ping
    Kang, Yuhan
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (02)
  • [7] A Survey: Neural Network-Based Deep Learning for Acoustic Event Detection
    Xia, Xianjun
    Togneri, Roberto
    Sohel, Ferdous
    Zhao, Yuanjun
    Huang, Defeng
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (08) : 3433 - 3453
  • [8] A Survey: Neural Network-Based Deep Learning for Acoustic Event Detection
    Xianjun Xia
    Roberto Togneri
    Ferdous Sohel
    Yuanjun Zhao
    Defeng Huang
    [J]. Circuits, Systems, and Signal Processing, 2019, 38 : 3433 - 3453
  • [9] A survey on neural network-based image data hiding for secure communication
    Wu, Yue
    Yu, Peipeng
    Yuan, Chengsheng
    [J]. INTERNATIONAL JOURNAL OF AUTONOMOUS AND ADAPTIVE COMMUNICATIONS SYSTEMS, 2023, 16 (05) : 476 - 493
  • [10] Deep neural network-based image region detection for endangered species
    Jang, Woohyuk
    Joung, Jinoo
    Lee, Eui Chul
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2018, 124 : 62 - 63