Arabic Captioning for Images of Clothing Using Deep Learning

被引:2
|
作者
Al-Malki, Rasha Saleh [1 ]
Al-Aama, Arwa Yousuf [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Comp Sci Dept, Jeddah 21589, Saudi Arabia
关键词
deep learning; image captioning; transfer learning; image attributes;
D O I
10.3390/s23083783
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Fashion is one of the many fields of application that image captioning is being used in. For e-commerce websites holding tens of thousands of images of clothing, automated item descriptions are quite desirable. This paper addresses captioning images of clothing in the Arabic language using deep learning. Image captioning systems are based on Computer Vision and Natural Language Processing techniques because visual and textual understanding is needed for these systems. Many approaches have been proposed to build such systems. The most widely used methods are deep learning methods which use the image model to analyze the visual content of the image, and the language model to generate the caption. Generating the caption in the English language using deep learning algorithms received great attention from many researchers in their research, but there is still a gap in generating the caption in the Arabic language because public datasets are often not available in the Arabic language. In this work, we created an Arabic dataset for captioning images of clothing which we named "ArabicFashionData" because this model is the first model for captioning images of clothing in the Arabic language. Moreover, we classified the attributes of the images of clothing and used them as inputs to the decoder of our image captioning model to enhance Arabic caption quality. In addition, we used the attention mechanism. Our approach achieved a BLEU-1 score of 88.52. The experiment findings are encouraging and suggest that, with a bigger dataset, the attributes-based image captioning model can achieve excellent results for Arabic image captioning.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Vision to Language: Captioning Images using Deep Learning
    Charu, Shreyasi
    Mishra, S. P.
    Gandhi, Tapan
    [J]. 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2020,
  • [2] AraCap: A hybrid deep learning architecture for Arabic Image Captioning
    Afyouni, Imad
    Azhar, Imtinan
    Elnagar, Ashraf
    [J]. AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 382 - 389
  • [3] Image Captioning Using Deep Learning
    Adithya, Paluvayi Veera
    Kalidindi, Mourya Viswanadh
    Swaroop, Nallani Jyothi
    Vishwas, H. N.
    [J]. ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT III, 2024, 2092 : 42 - 58
  • [4] Image Captioning using Deep Learning
    Jain, Yukti Sanjay
    Dhopeshwar, Tanisha
    Chadha, Supreet Kaur
    Pagire, Vrushali
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021,
  • [5] Image and audio caps: automated captioning of background sounds and images using deep learning
    M. Poongodi
    Mounir Hamdi
    Huihui Wang
    [J]. Multimedia Systems, 2023, 29 : 2951 - 2959
  • [6] Image and audio caps: automated captioning of background sounds and images using deep learning
    Poongodi, M.
    Hamdi, Mounir
    Wang, Huihui
    [J]. MULTIMEDIA SYSTEMS, 2023, 29 (05) : 2951 - 2959
  • [7] Advanced Generative Deep Learning Techniques for Accurate Captioning of Images
    Chandar, J. Navin
    Kavitha, G.
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2024,
  • [8] Captioning the Images: A Deep Analysis
    Chaudhari, Chaitrali P.
    Devane, Satish
    [J]. COMPUTING, COMMUNICATION AND SIGNAL PROCESSING, ICCASP 2018, 2019, 810 : 987 - 999
  • [9] Historical Arabic Images Classification and Retrieval Using Siamese Deep Learning Model
    Khayyat, Manal M.
    Elrefaei, Lamiaa A.
    Khayyat, Mashael M.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 2109 - 2125
  • [10] Deep Learning for automatically describing images in natural language - Image Captioning
    Hotaran, Anca Mihaela
    Vrejoiu, Mihnea Horia
    [J]. ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2020, 30 (01): : 87 - 100