An Integrated Hybrid CNN-RNN Model for Visual Description and Generation of Captions

被引:18
|
作者
Khamparia, Aditya [1 ]
Pandey, Babita [2 ]
Tiwari, Shrasti [3 ]
Gupta, Deepak [4 ]
Khanna, Ashish [4 ]
Rodrigues, Joel J. P. C. [5 ,6 ]
机构
[1] Lovely Profess Univ, Sch Comp Sci & Engn, Phagwara, Punjab, India
[2] Babasaheb Bhimrao Ambedkar Univ, Dept Comp Sci & IT, Satellite Campus, Amethi, UP, India
[3] Lovely Profess Univ, Div Examinat, Phagwara, Punjab, India
[4] Maharaja Agrasen Inst Technol, Delhi, India
[5] Fed Univ Piaui UFPI, Teresina, PI, Brazil
[6] Inst Telecomunicacoes, Lisbon, Portugal
关键词
Captions; Long short-term memory; Convolutional neural network; Recurrent neural network; Feature vectors; Extraction;
D O I
10.1007/s00034-019-01306-8
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video captioning is currently considered to be one of the simplest ways to index and search data efficiently. In today's era, suitable captioning of video images can be facilitated with deep learning architectures. The focus of past research has been on providing image captions; however, the generation of high-quality captions with suitable semantics for different scenes has not yet been achieved. Therefore, this work aims to generate well-defined and meaningful captions to images and videos by using convolutional neural networks (CNN) and recurrent neural networks in combination. Beginning with the available dataset, features of images and videos were extracted using CNN. The extracted feature vectors were then utilized to generate a language model with the involvement of long short-term memory for individual word grams. The generated meaningful captions were trained using a softmax function, for performance computation using some predefined evaluation metrics. The obtained experimental results demonstrate that the proposed model outperforms existing benchmark models.
引用
收藏
页码:776 / 788
页数:13
相关论文
共 50 条
  • [31] A Hybrid RNN-CNN Encoder for Neural Conversation Model
    Ma, Zhiyuan
    Rong, Wenge
    Wang, Yanmeng
    Shi, Libin
    Xiong, Zhang
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2018, PT II, 2018, 11062 : 159 - 170
  • [32] Drug review sentimental analysis based on modular lexicon generation and a fusion of bidirectional threshold weighted mapping CNN-RNN
    Dubey, Gaurav
    Singh, Harivans Pratap
    Sheoran, Kavita
    Dhand, Geetika
    Malik, Pooja
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (03):
  • [33] Effective offline handwritten text recognition model based on a sequence-to-sequence approach with CNN-RNN networks
    Geetha, R.
    Thilagam, T.
    Padmavathy, T.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (17): : 10923 - 10934
  • [34] An Attention Mechanism Oriented Hybrid CNN-RNN Deep Learning Architecture of Container Terminal Liner Handling Conditions Prediction
    Li, Bin
    He, Yuqing
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [35] Artificial Humming Bird Optimization-Based Hybrid CNN-RNN for Accurate Exudate Classification from Fundus Images
    Dhiravidachelvi, E.
    Pandi, Senthil S.
    Prabavathi, R.
    Subramanian, Bala C.
    [J]. JOURNAL OF DIGITAL IMAGING, 2023, 36 (01) : 59 - 72
  • [36] COVID-19 Detection using Hybrid CNN-RNN Architecture with Transfer Learning from X-Rays
    Deshwal D.
    Sangwan P.
    Dahiya N.
    Lilhore U.K.
    Dalal S.
    Simaiya S.
    [J]. Current Medical Imaging, 2024, 20
  • [37] MRI-Based Kinetic Heterogeneity Evaluation in the Accurate Access of Axillary Lymph Node Status in Breast Cancer Using a Hybrid CNN-RNN Model
    Guo, Yi-Jun
    Yin, Rui
    Zhang, Qian
    Han, Jun-Qi
    Dou, Zhao-Xiang
    Wang, Peng-Bo
    Lu, Hong
    Liu, Pei-Fang
    Chen, Jing-Jing
    Ma, Wen-Juan
    [J]. JOURNAL OF MAGNETIC RESONANCE IMAGING, 2024,
  • [38] Editorial for "MRI-Based Kinetic Heterogeneity Evaluation in the Accurate Access of Axillary Lymph Node Status in Breast Cancer Using a Hybrid CNN-RNN Model"
    Grovik, Endre
    [J]. JOURNAL OF MAGNETIC RESONANCE IMAGING, 2024,
  • [39] A combined short-term wind speed forecasting model based on CNN-RNN and linear regression optimization considering error
    Duan, Jikai
    Chang, Mingheng
    Chen, Xiangyue
    Wang, Wenpeng
    Zuo, Hongchao
    Bai, Yulong
    Chen, Bolong
    [J]. RENEWABLE ENERGY, 2022, 200 : 788 - 808
  • [40] CNUSVM: Hybrid CNN-Uneven SVM Model for Imbalanced Visual Learning
    Geng, Mengyue
    Wang, Yaowei
    Tian, Yonghong
    Huang, Tiejun
    [J]. 2016 IEEE SECOND INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2016, : 186 - 193