Automated Image Captioning Using Sparrow Search Algorithm With Improved Deep Learning Model

被引:0
|
作者
Arasi, Munya A. [1 ]
Alshahrani, Haya Mesfer [2 ]
Alruwais, Nuha [3 ]
Motwakel, Abdelwahed [4 ]
Ahmed, Noura Abdelaziz [5 ]
Mohamed, Abdullah [6 ]
机构
[1] King Khalid Univ, Coll Sci & Arts Rijal Almaa, Dept Comp Sci, Abha 62529, Saudi Arabia
[2] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Syst, POB 84428, Riyadh 11671, Saudi Arabia
[3] King Saud Univ, Coll Appl Studies & Community Serv, Dept Comp Sci & Engn, POB 22459, Riyadh 11495, Saudi Arabia
[4] Prince Sattam Bin Abdulaziz Univ, Coll Business Adm Hawtat Bani Tamim, Dept Management Informat Syst, Al Kharj 11942, Saudi Arabia
[5] Prince Sattam Bin Abdulaziz Univ, Dept Comp & Self Dev, Preparatory Year Deanship, Al Kharj 11942, Saudi Arabia
[6] Future Univ Egypt, Res Ctr, New Cairo 11845, Egypt
关键词
Convolutional neural networks; Visualization; Feature extraction; Convolution; Deep learning; Natural language processing; Computational modeling; Image capture; Search methods; Image captioning; deep learning; natural language processing; sparrow search algorithm; computer vision;
D O I
10.1109/ACCESS.2023.3317276
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image captioning is a deep learning technique that intends to create and generate textual descriptions or captions for images. It integrates computer vision and natural language processing (NLP) to comprehend the visual content of an image and generate human-like descriptions. Deep learning (DL) based image captioning models can be trained on large-scale datasets, allowing them to generalize various types of images and generate captions that apply to a wide range of visual scenarios. By combining computer vision and natural language processing, DL-enabled image captioning models can understand both visual and textual information, which enables them to generate captions that not only describe the visual content but also incorporate contextual and semantic information. This study develops an Automated Image Captioning using Sparrow Search Algorithm with Improved Deep Learning (AIC-SSAIDL) technique. The major intention of the AIC-SSAIDL technique lies in the automated generation of textual captions for the input images. To accomplish this, the AIC-SSAIDL technique utilizes the MobileNetv2 model to generate feature descriptors of the input images and its hyperparameter tuning process takes place using SSA. For the image captioning process, the AIC-SSAIDL technique utilizes an attention mechanism with long short-term memory (AM-LSTM) network. Finally, the hyperparameter selection of the AM-LSTM model is performed by the fruit fly optimization (FFO) algorithm. A wide range of experiments has been conducted on benchmark data to depict the better performance of the AIC-SSAIDL method. The comprehensive result analysis highlighted the enhanced captioning results of the AIC-SSAIDL method with maximum CIDEr of 46.12, 61.89, and 137.45 on Flickr8k, Flickr30k, and MSCOCO datasets, respectively.
引用
收藏
页码:104633 / 104642
页数:10
相关论文
共 50 条
  • [21] Image Captioning Using Multimodal Deep Learning Approach
    Farkh, Rihem
    Oudinet, Ghislain
    Foued, Yasser
    Computers, Materials and Continua, 2024, 81 (03): : 3951 - 3968
  • [22] Deep Learning Network Based on Improved Sparrow Search Algorithm Optimization for Rolling Bearing Fault Diagnosis
    Ma, Guoyuan
    Yue, Xiaofeng
    Zhu, Juan
    Liu, Zeyuan
    Lu, Shibo
    MATHEMATICS, 2023, 11 (22)
  • [23] Improved Spectral Clustering Clothing Image Segmentation Algorithm Based on Sparrow Search Algorithm
    黄文谙
    钱素琴
    Journal of Donghua University(English Edition), 2022, 39 (04) : 340 - 344
  • [24] Multi-threshold image segmentation based on improved sparrow search algorithm
    Lyu X.
    Mu X.
    Zhang J.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (02): : 318 - 327
  • [25] Lens Learning Sparrow Search Algorithm
    Ouyang, Chengtian
    Zhu, Donglin
    Qiu, Yaxian
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [26] Deep Learning for Military Image Captioning
    Das, Subrata
    Jain, Lalit
    Das, Amp
    2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 2165 - 2171
  • [27] RETRACTED: Medical Image Captioning Using Optimized Deep Learning Model (Retracted Article)
    Singh, Arjun
    Raguru, Jaya Krishna
    Prasad, Gaurav
    Chauhan, Surbhi
    Tiwari, Pradeep Kumar
    Zaguia, Atef
    Ullah, Mohammad Aman
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [28] A Network Security Transmission Model Based on Improved Sparrow Search Algorithm
    State Grid Kashi Electric Power Supply Company, Kashgar, China
    不详
    Int. J. Inf. Secur. Priv., 1
  • [29] Malicious URL Classification Model Based on Improved Sparrow Search Algorithm
    Ma, Yiran
    Guan, Qihang
    Guo, Fengyuan
    Zhang, Guidong
    PROCEEDINGS OF 2021 IEEE 11TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2021), 2021, : 21 - 25
  • [30] Sparrow Search Algorithm With Stacked Deep Learning Based Medical Image Analysis for Pancreatic Cancer Detection and Classification
    Ramesh, Janjhyam Venkata Naga
    Abirami, T.
    Gopalakrishnan, T.
    Narayanasamy, Kanagaraj
    Ishak, Mohamad Khairi
    Karim, Faten Khalid
    Mostafa, Samih M.
    Allakany, Alaa
    IEEE ACCESS, 2023, 11 : 111927 - 111935