An Information Multiplexed Encoder-Decoder Network for Image Captioning in Hindi

被引:3
|
作者
Mishra, Santosh Kumar [1 ]
Peethala, Mahesh Babu [1 ]
Saha, Sriparna [1 ]
Bhattacharyya, Pushpak [2 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna, Bihar, India
[2] Indian Inst Technol, Mumbai, Maharashtra, India
关键词
D O I
10.1109/SMC52423.2021.9658859
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Image captioning is a multi-modal problem linking computer vision and natural language processing, which combines image analysis and text generation challenges. In the literature, most of the image captioning works have been accomplished in the English language only. This paper proposes a new approach for image captioning in the Hindi language using deep learning-based encoder-decoder architecture. Hindi, widely spoken in India and South Asia, is the fourth most spoken language globally; it is India's official language. In recent years, significant advancement has been made in image captioning, utilizing encoder-decoder architectures based on convolutional neural networks (CNNs) and recurrent neural networks (RNNs). Encoder CNN extracts features from input images, whereas decoder RNN performs language modeling. The proposed encoder-decoder architecture utilizes information multiplexing in the encoder CNN to achieve a performance gain in feature extraction. Extensive experimentation is carried out on the benchmark MSCOCO Hindi dataset, and significant improvements in BLEU score are reported compared to the baselines. Manual human evaluation in terms of adequacy and fluency of the generated captions further establishes the proposed method's efficacy in generating good quality captions.
引用
收藏
页码:3019 / 3024
页数:6
相关论文
共 50 条
  • [31] A two-branch encoder-decoder network for image tampering localization
    Luo, Yuling
    Liang, Ce
    Qin, Sheng
    Liu, Junxiu
    Fu, Qiang
    Yang, Su
    APPLIED SOFT COMPUTING, 2024, 164
  • [32] Robust Image Watermarking Framework Powered by Convolutional Encoder-Decoder Network
    Thien Huynh-The
    Hua, Cam-Hao
    Nguyen Anh Tu
    Kim, Dong-Seong
    2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 552 - 558
  • [33] OverSegNet: A convolutional encoder-decoder network for image over-segmentation
    Li, Peng
    Ma, Wei
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 107
  • [34] Encoder-decoder Network with Self-attention Module for Image Restoration
    Jin, Qing
    Yu, Qi
    Liu, Jiying
    Tan, Xintong
    THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [35] RedCap: residual encoder-decoder capsule network for holographic image reconstruction
    Zeng, Tianjiao
    So, Hayden K-H
    Lam, Edmund Y.
    OPTICS EXPRESS, 2020, 28 (04) : 4876 - 4887
  • [36] Encoder-Decoder Model for Automatic Video Captioning Using Yolo Algorithm
    Alkalouti, Hanan Nasser
    Al Masre, Mayada Ahmed
    2021 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS), 2021, : 718 - 721
  • [37] ATTENTION-BASED ENCODER-DECODER NETWORK FOR SINGLE IMAGE DEHAZING
    Gao, Shunan
    Zhu, Jinghua
    Xi, Heran
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [38] NucleiNet: A Convolutional Encoder-decoder Network for Bio-image Denoising
    Liu, Zichuan
    Hu, Yifei
    Xu, Hang
    Nasser, Lamees
    Coquet, Philippe
    Boudier, Thomas
    Yu, Hao
    2017 39TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2017, : 1986 - 1989
  • [39] VisCode: Embedding Information in Visualization Images using Encoder-Decoder Network
    Zhang, Peiying
    Li, Chenhui
    Wang, Changbo
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (02) : 326 - 336
  • [40] Encoder-decoder network with RMP for tongue segmentation
    Kusakunniran, Worapan
    Borwarnginn, Punyanuch
    Karnjanapreechakorn, Sarattha
    Thongkanchorn, Kittikhun
    Ritthipravat, Panrasee
    Tuakta, Pimchanok
    Benjapornlert, Paitoon
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (05) : 1193 - 1207