Automatic image captioning combining natural language processing and deep neural networks

被引:8
|
作者
Rinaldi, Antonio M. [1 ]
Russo, Cristiano [1 ]
Tommasino, Cristian [1 ]
机构
[1] Univ Naples Federico II, Dept Elect Engn & Informat Technol, IKNOS LAB Intelligent & Knowledge Syst LUPT, Via Claudio 21, I-80125 Naples, Italy
关键词
Object detection; Image captioning; Deep neural networks; Semantic-instance segmentation;
D O I
10.1016/j.rineng.2023.101107
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
An image contains a lot of information that humans can detect in a very short time. Image captioning aims to detect this information by describing the image content through image and text processing techniques. One of the peculiarities of the proposed approach is the combination of multiple networks to catch as many distinct features as possible from a semantic point of view. In this work, our goal is to prove that a combination strategy of existing methods can efficiently improve the performance in the object detection tasks concerning the performance achieved by each tested individually. This approach involves using different deep neural networks that perform two levels of hierarchical object detection in an image. The results are combined and used by a captioning module that generates image captions through natural language processing techniques. Several experimental results are reported and discussed to show the effectiveness of our framework. The combination strategy has also improved, showing a gain in precision over single models.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Automatic Design of Deep Neural Networks Applied to Image Segmentation Problems
    Lima, Ricardo
    Pozo, Aurora
    Mendiburu, Alexander
    Santana, Roberto
    GENETIC PROGRAMMING, EUROGP 2021, 2021, 12691 : 98 - 113
  • [42] A survey on deep neural network-based image captioning
    Xiaoxiao Liu
    Qingyang Xu
    Ning Wang
    The Visual Computer, 2019, 35 : 445 - 470
  • [43] A survey on deep neural network-based image captioning
    Liu, Xiaoxiao
    Xu, Qingyang
    Wang, Ning
    VISUAL COMPUTER, 2019, 35 (03): : 445 - 470
  • [44] Networks and Natural Language Processing
    Radev, Dragomir R.
    Mihalcea, Rada
    AI MAGAZINE, 2008, 29 (03) : 16 - 28
  • [45] Automatic Bangla Image Captioning Based on Transformer Model in Deep Learning
    Hossain, Md Anwar
    Hasan, Mirza A. F. M. Rashidul
    Hossen, Ebrahim
    Asraful, Md
    Faruk, Md Omar
    Abadin, A. F. M. Zainul
    Ali, Md Suhag
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 1110 - 1117
  • [46] Artificial Neural Networks Applied to Natural Language Processing in Academic Texts
    Yail Marquez, Bogart
    Alanis, Arnulfo
    Magdaleno-Palencia, Jose Sergio
    Quezada, Angeles
    ADVANCED RESEARCH IN TECHNOLOGIES, INFORMATION, INNOVATION AND SUSTAINABILITY, ARTIIS 2022, PT I, 2022, 1675 : 535 - 545
  • [47] Recurrent Neural Networks with Mixed Hierarchical Structures for Natural Language Processing
    Luo, Zhaoxin
    Zhu, Michael
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [48] Psycho-analysis Using Natural Language Processing and Neural Networks
    Goyal, Agam
    Kacker, Rashi
    Maringanti, Hima Bindu
    ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 3, 2013, 178 : 185 - 193
  • [49] An Evaluation of Progressive Neural Networks for Transfer Learning in Natural Language Processing
    Hagerer, Gerhard
    Moeed, Abdul
    Dugar, Sumit
    Gupta, Sarthak
    Ghosh, Mainak
    Danner, Hannah
    Mitevski, Oliver
    Nawroth, Andreas
    Groh, Georg
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 1376 - 1381
  • [50] Deep Neural Networks in Natural Language Processing for Classifying Requirements by Origin and Functionality: An Application of BERT in System Requirements
    Mullis, Jesse
    Chen, Cheng
    Morkos, Beshoy
    Ferguson, Scott
    JOURNAL OF MECHANICAL DESIGN, 2024, 146 (04)