A survey on deep neural network-based image captioning

被引:0
|
作者
Xiaoxiao Liu
Qingyang Xu
Ning Wang
机构
[1] Shandong University,School of Mechanical, Electrical and Information Engineering
[2] Dalian Maritime University,Marine Engineering College
来源
The Visual Computer | 2019年 / 35卷
关键词
Image captioning; Image understanding; Object detection; Language model; Attention mechanism; Dense captioning;
D O I
暂无
中图分类号
学科分类号
摘要
Image captioning is a hot topic of image understanding, and it is composed of two natural parts (“look” and “language expression”) which correspond to the two most important fields of artificial intelligence (“machine vision” and “natural language processing”). With the development of deep neural networks and better labeling database, the image captioning techniques have developed quickly. In this survey, the image captioning approaches and improvements based on deep neural network are introduced, including the characteristics of the specific techniques. The early image captioning approach based on deep neural network is the retrieval-based method. The retrieval method makes use of a searching technique to find an appropriate image description. The template-based method separates the image captioning process into object detection and sentence generation. Recently, end-to-end learning-based image captioning method has been verified effective at image captioning. The end-to-end learning techniques can generate more flexible and fluent sentence. In this survey, the image captioning methods are reviewed in detail. Furthermore, some remaining challenges are discussed.
引用
收藏
页码:445 / 470
页数:25
相关论文
共 50 条
  • [31] Recurrent Neural Network for Content Based Image Retrieval Using Image Captioning Model
    Sindu, S.
    Kousalya, R.
    [J]. COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 1067 - 1077
  • [32] Deep Image Captioning Survey: A Resource Availability Perspective
    Al Sulaimi, Mousa
    Ahmad, Imtiaz
    Jeragh, Mohammad
    [J]. PROCEEDINGS OF THE 2021 29TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), VOL 1, 2021, : 3 - 13
  • [33] Deep neural network-based relation extraction: an overview
    Wang, Hailin
    Qin, Ke
    Zakari, Rufai Yusuf
    Lu, Guoming
    Yin, Jin
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (06): : 4781 - 4801
  • [34] DeepLoc: Deep Neural Network-based Telco Localization
    Zhang, Yige
    Xiao, Yu
    Zhao, Kai
    Rao, Weixiong
    [J]. PROCEEDINGS OF THE 16TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS'19), 2019, : 258 - 267
  • [35] Analytics of Deep Neural Network-Based Background Subtraction
    Minematsu, Tsubasa
    Shimada, Atsushi
    Uchiyama, Hideaki
    Taniguchi, Rin-ichiro
    [J]. JOURNAL OF IMAGING, 2018, 4 (06)
  • [36] DeepSL: Deep Neural Network-based Similarity Learning
    Tourad, Mohamedou Cheikh
    Abdelmounaim, Abdali
    Dhleima, Mohamed
    Telmoud, Cheikh Abdelkader Ahmed
    Lachgar, Mohamed
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (03) : 1394 - 1401
  • [37] Deep neural network-based relation extraction: an overview
    Hailin Wang
    Ke Qin
    Rufai Yusuf Zakari
    Guoming Lu
    Jin Yin
    [J]. Neural Computing and Applications, 2022, 34 : 4781 - 4801
  • [38] Analytic Deep Neural Network-Based Robot Control
    Nguyen, Huu-Thiet
    Cheah, Chien Chern
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (04) : 2176 - 2184
  • [39] Deep neural network-based underwater OFDM receiver
    Zhang, Jing
    Cao, Yu
    Han, Guangyao
    Fu, Xiaomei
    [J]. IET COMMUNICATIONS, 2019, 13 (13) : 1998 - 2002
  • [40] Deep neural network-based secure healthcare framework
    Aldaej, Abdulaziz
    Ahanger, Tariq Ahamed
    Ullah, Imdad
    [J]. Neural Computing and Applications, 2024, 36 (28) : 17467 - 17482