A novel automatic image caption generation using bidirectional long-short term memory framework

被引:0
|
作者
Zhongfu Ye
Rashid Khan
Nuzhat Naqvi
M. Shujah Islam
机构
[1] University of Science and Technology of China,
来源
关键词
Image captioning; inception v3; B-LSTM; P-MFO optimization; Bleu score;
D O I
暂无
中图分类号
学科分类号
摘要
Image Captioning, the process of generating a textual description of an image, has emerged as a hot research due to its practical importance in many domains. It is a challenging task as it uses both Natural Language Processing and Computer Vision related fields to generate the captions. Despite the fact that the literature has reported notable image captioning methodologies, they still lag in accomplishing the substantial performance level for diverse datasets. This paper proposes an image caption generating mechanism based on Optimized Bidirectional Long Short-Term Memory (B-LSTM) model. We propose a variant of Moth Flame Optimization (PMFO), termed here as Proposed Moth Flame Optimization (PMFO), which has logarithmic spiral update based on correlation. The performance of the proposed model is demonstrated on benchmark datasets like Flicker 8 k, Flicker30k, VizWik and COCO datasets using renowned metrics such as CIDEr, BLEU, SPICE and ROUGH. The performance analysis proves that the B-LSTM achieves better performance on caption generation than state-of-the-art methods.
引用
收藏
页码:25557 / 25582
页数:25
相关论文
共 50 条
  • [31] Load Demand Forecasting Using a Long-Short Term Memory Neural Network
    Ortega, Arturo
    Borunda, Monica
    Conde, Luis
    Garcia-Beltran, Carlos
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2023, PT I, 2024, 14391 : 121 - 137
  • [32] Zero Shot Intent Classification Using Long-Short Term Memory Networks
    Williams, Kyle
    [J]. INTERSPEECH 2019, 2019, : 844 - 848
  • [33] Photonic Long-Short Term Memory Neural Networks with Analog Memory
    Howard, Emma R.
    Marquez, Bicky A.
    Shastri, Bhavin J.
    [J]. 2020 IEEE PHOTONICS CONFERENCE (IPC), 2020,
  • [34] Automatic defect detection and three-dimensional reconstruction from pulsed thermography images based on a bidirectional long-short term memory network
    Wu, Zhuoqiao
    Chen, Siyun
    Feng, Fan
    Qi, Jinrong
    Feng, Lichun
    Tao, Ning
    Zhang, Cunlin
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 124
  • [35] Surface roughness prediction in milling using long-short term memory modelling
    Manjunath, K.
    Tewary, Suman
    Khatri, Neha
    [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 64 : 1300 - 1304
  • [36] Individualized Location Prediction Using Autoencoders and Long-Short Term Memory Networks
    Onwujekwe, Gerald
    Men, Zibo
    Duke, Joseph
    [J]. 2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [37] Automatic Image Caption Generation Using ResNet & Torch Vision
    Verma, Vijeta
    Saritha, Sri Khetwat
    Jain, Sweta
    [J]. MACHINE LEARNING, IMAGE PROCESSING, NETWORK SECURITY AND DATA SCIENCES, MIND 2022, PT II, 2022, 1763 : 82 - 101
  • [38] Automatic Classification of Normal-Abnormal Heart Sounds Using Convolution Neural Network and Long-Short Term Memory
    Chen, Ding
    Xuan, Weipeng
    Gu, Yexing
    Liu, Fuhai
    Chen, Jinkai
    Xia, Shudong
    Jin, Hao
    Dong, Shurong
    Luo, Jikui
    [J]. ELECTRONICS, 2022, 11 (08)
  • [39] A Novel CNN, Bidirectional Long-Short Term Memory, and Gated Recurrent Unit-Based Hybrid Approach for Human Activity Recognition
    Thakur, Narina
    Singh, Sunil K.
    Gupta, Akash
    Jain, Kunal
    Jain, Rachna
    Perakovic, Dragan
    Nedjah, Nadia
    Rafsanjani, Marjan Kuchaki
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2022, 14 (01):
  • [40] Long Short-Term Memory Networks for Automatic Generation of Conversations
    Fujita, Tomohiro
    Bai, Wenjun
    Quan, Changqin
    [J]. 2017 18TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNDP 2017), 2017, : 483 - 487