Generative image captioning in Urdu using deep learning

被引:3
|
作者
Afzal M.K. [1 ]
Shardlow M. [2 ]
Tuarob S. [3 ]
Zaman F. [1 ]
Sarwar R. [2 ]
Ali M. [1 ]
Aljohani N.R. [5 ]
Lytras M.D. [5 ]
Nawaz R. [4 ]
Hassan S.-U. [2 ]
机构
[1] Information Technology University, Lahore
[2] Manchester Metropolitan University, Manchester
[3] Mahidol University, Nakhon Pathom
[4] Stafforshire University, Stoke-on-Trent
[5] King Abdulaziz University, Jeddah
关键词
Deeplearning; Image captioning; Information retrieval; Natural language processing; Urdu;
D O I
10.1007/s12652-023-04584-y
中图分类号
学科分类号
摘要
Urdu is morphologically rich language and lacks the resources available in English. While several studies on the image captioning task in English have been published, this is among the pioneer studies on Urdu generative image captioning. The study makes several key contributions: (i) it presents a new dataset for Urdu image captioning, and (ii) it presents different attention-based architectures for image captioning in the Urdu language. These attention mechanisms are new to the Urdu language, as those have never been used for the Urdu image captioning task (iii) Finally, it performs quantitative and qualitative analysis of the results by studying the impact of different model architectures on Urdu’s image caption generation task. The extensive experiments on the Urdu image caption generation task show encouraging results such as a BLEU-1 score of 72.5, BLEU-2 of 56.9, BLEU-3 of 42.8, and BLEU-4 of 31.6. Finally, we present data and code used in the study for future research via GitHub (https://github.com/saeedhas/Urdu_cap_gen). © 2023, The Author(s).
引用
收藏
页码:7719 / 7731
页数:12
相关论文
共 50 条
  • [1] Image Captioning Using Deep Learning
    Adithya, Paluvayi Veera
    Kalidindi, Mourya Viswanadh
    Swaroop, Nallani Jyothi
    Vishwas, H. N.
    [J]. ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT III, 2024, 2092 : 42 - 58
  • [2] Image Captioning using Deep Learning
    Jain, Yukti Sanjay
    Dhopeshwar, Tanisha
    Chadha, Supreet Kaur
    Pagire, Vrushali
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021,
  • [3] Image and Video Captioning for Apparels Using Deep Learning
    Agarwal, Govind
    Jindal, Kritika
    Chowdhury, Abishi
    Singh, Vishal K.
    Pal, Amrit
    [J]. IEEE ACCESS, 2024, 12 : 113138 - 113150
  • [4] Deep Learning for Military Image Captioning
    Das, Subrata
    Jain, Lalit
    Das, Amp
    [J]. 2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 2165 - 2171
  • [5] Image Captioning using Deep Learning: A Systematic Literature Review
    Chohan, Murk
    Khan, Adil
    Mahar, Muhammad Saleem
    Hassan, Saif
    Ghafoor, Abdul
    Khan, Mehmood
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 278 - 286
  • [6] Automatic image captioning system using a deep learning approach
    Deepak, Gerard
    Gali, Sowmya
    Sonker, Abhilash
    Jos, Bobin Cherian
    Sagar, K. V. Daya
    Singh, Charanjeet
    [J]. SOFT COMPUTING, 2023,
  • [7] Image Classification using Generative NeuroEvolution for Deep Learning
    Verbancsics, Phillip
    Harguess, Josh
    [J]. 2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 488 - 493
  • [8] Advanced Generative Deep Learning Techniques for Accurate Captioning of Images
    Chandar, J. Navin
    Kavitha, G.
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2024,
  • [9] A reference-based model using deep learning for image captioning
    Tiago do Carmo Nogueira
    Cássio Dener Noronha Vinhal
    Gélson da Cruz Júnior
    Matheus Rudolfo Diedrich Ullmann
    Thyago Carvalho Marques
    [J]. Multimedia Systems, 2023, 29 : 1665 - 1681
  • [10] A Comprehensive Survey of Deep Learning for Image Captioning
    Hossain, Md Zakir
    Sohel, Ferdous
    Shiratuddin, Mohd Fairuz
    Laga, Hamid
    [J]. ACM COMPUTING SURVEYS, 2019, 51 (06)