Automatic image captioning combining natural language processing and deep neural networks

被引:8
|
作者
Rinaldi, Antonio M. [1 ]
Russo, Cristiano [1 ]
Tommasino, Cristian [1 ]
机构
[1] Univ Naples Federico II, Dept Elect Engn & Informat Technol, IKNOS LAB Intelligent & Knowledge Syst LUPT, Via Claudio 21, I-80125 Naples, Italy
关键词
Object detection; Image captioning; Deep neural networks; Semantic-instance segmentation;
D O I
10.1016/j.rineng.2023.101107
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
An image contains a lot of information that humans can detect in a very short time. Image captioning aims to detect this information by describing the image content through image and text processing techniques. One of the peculiarities of the proposed approach is the combination of multiple networks to catch as many distinct features as possible from a semantic point of view. In this work, our goal is to prove that a combination strategy of existing methods can efficiently improve the performance in the object detection tasks concerning the performance achieved by each tested individually. This approach involves using different deep neural networks that perform two levels of hierarchical object detection in an image. The results are combined and used by a captioning module that generates image captions through natural language processing techniques. Several experimental results are reported and discussed to show the effectiveness of our framework. The combination strategy has also improved, showing a gain in precision over single models.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Natural Language Processing with Improved Deep Learning Neural Networks
    Zhou, Yitao
    Scientific Programming, 2022, 2022
  • [2] Natural Language Processing with Improved Deep Learning Neural Networks
    Zhou, YiTao
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [3] Natural Language Processing with Optimal Deep Learning-Enabled Intelligent Image Captioning System
    Marzouk, Radwa
    Alabdulkreem, Eatedal
    Nour, Mohamed K.
    Al Duhayyim, Mesfer
    Othman, Mahmoud
    Zamani, Abu Sarwar
    Yaseen, Ishfaq
    Motwakel, Abdelwahed
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 4435 - 4451
  • [4] Paragraph Image Captioning with Deep Fully Convolutional Neural Networks
    Li R.-F.
    Liang H.-Y.
    Feng F.-X.
    Zhang G.-W.
    Wang X.-J.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2019, 42 (06): : 155 - 161
  • [5] Natural language processing with neural networks
    Ma, Q
    LANGUAGE ENGINEERING CONFERENCE, PROCEEDINGS, 2003, : 45 - 56
  • [6] Deep Learning for automatically describing images in natural language - Image Captioning
    Hotaran, Anca Mihaela
    Vrejoiu, Mihnea Horia
    ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2020, 30 (01): : 87 - 100
  • [7] IMAGE STRUCTURED ANNOTATION BASED ON DEEP NEURAL NETWORK NATURAL LANGUAGE PROCESSING
    Jia, Jing
    Hua, Jing
    COMPUTING AND INFORMATICS, 2024, 43 (04) : 926 - 943
  • [8] Automatic generation of neural networks for image processing
    Soares, Andre B.
    Susin, Altamiro A.
    Guimaraes, Leticia V.
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 3201 - 3204
  • [9] Toward Backdoor Attacks for Image Captioning Model in Deep Neural Networks
    Kwon, Hyun
    Lee, Sanghyun
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [10] Lighting Search Algorithm With Convolutional Neural Network-Based Image Captioning System for Natural Language Processing
    Alnashwan, Rana Othman
    Chelloug, Samia Allaoua
    Almalki, Nabil Sharaf
    Issaoui, Imene
    Motwakel, Abdelwahed
    Sayed, Ahmed
    IEEE ACCESS, 2023, 11 : 142643 - 142651