Language of Gleam: Impressionism Artwork Automatic Caption Generation for People with Visual Impairments

被引:2
|
作者
Lee, Dongmyeong [1 ]
Hwang, Hyegyeong [2 ]
Jabbar, Muhammad Shahid [2 ]
Cho, Jun-Dong [2 ]
机构
[1] Sungkyunkwan Univ, Human ICT Convergence, Suwon, South Korea
[2] Sungkyunkwan Univ, Coll Informat & Commun Engn, Suwon, South Korea
关键词
visual artwork; image caption generation; human-computer interaction; human-centric artificial intelligence; deep learning; PERCEPTION;
D O I
10.1117/12.2588331
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
User Experience Design (UX Design) comes from focusing on how products, in reality, affect the user's experience. In particular, the design of multi-modal interfaces for blind people facilitates the flexible and natural product or service capacity and improves blind people's interaction by overcoming the various existing constraints associated with any particular interaction. There have been various attempts to help visually impaired people appreciation of visual artwork, including multi-modal associations. However, these methods can only provide general information in terms of edge and pattern recognition by the sense of touch and restrained by the availability and number of specially developed artworks. We propose a novel method explaining visual artworks through image caption generation using artificial intelligence (AI) to improve artwork accessibility. This method can objectively describe any impressionism artwork used as a standalone description of art interpretation for blind people or can aide tactile-based methods. Based on end-to-end learning with a deep neural network, an encoder-decoder architecture model is adopted, and comprehensive experiments perform to confirm the stability of generated image captioning for stylized MS-COCO datasets with impressionism.
引用
收藏
页数:8
相关论文
共 13 条
  • [1] Tactile colour pictogram to improve artwork appreciation of people with visual impairments
    Cho, Jun Dong
    Quero, Luis Cavazos
    Bartolome, Jorge Iranzo
    Lee, Do Won
    Oh, Uran
    Lee, Inae
    [J]. COLOR RESEARCH AND APPLICATION, 2021, 46 (01): : 103 - 116
  • [2] Sound Coding Color to Improve Artwork Appreciation by People with Visual Impairments
    Cho, Jun Dong
    Jeong, Jaeho
    Kim, Ji Hye
    Lee, Hoonsuk
    [J]. ELECTRONICS, 2020, 9 (11) : 1 - 20
  • [3] GVA: guided visual attention approach for automatic image caption generation
    Hossen, Md. Bipul
    Ye, Zhongfu
    Abdussalam, Amr
    Hossain, Md. Imran
    [J]. MULTIMEDIA SYSTEMS, 2024, 30 (01)
  • [4] GVA: guided visual attention approach for automatic image caption generation
    Md. Bipul Hossen
    Zhongfu Ye
    Amr Abdussalam
    Md. Imran Hossain
    [J]. Multimedia Systems, 2024, 30
  • [5] AUTOMATIC GENERATION OF MATHEMATICAL GRAPH DESCRIPTIONS FOR STUDENTS WITH VISUAL IMPAIRMENTS
    Na, Heewon
    Yook, Juhye
    Dong, Suh-Yeon
    [J]. JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2021, 22 (09) : 1897 - 1912
  • [6] Automatic Generation of Image Caption Based on Semantic Relation using Deep Visual Attention Prediction
    El-gayar, M. M.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 105 - 114
  • [7] Automatic recognition of the American sign language fingerspelling alphabet to assist people living with speech or hearing impairments
    Luis Quesada
    Gustavo López
    Luis Guerrero
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2017, 8 : 625 - 635
  • [8] Automatic recognition of the American sign language fingerspelling alphabet to assist people living with speech or hearing impairments
    Quesada, Luis
    Lopez, Gustavo
    Guerrero, Luis
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2017, 8 (04) : 625 - 635
  • [9] Helping people with visual impairments gain access to graphical information through natural language:: The iGraph system
    Ferres, Leo
    Parush, Avi
    Roberts, Shelley
    Lindgaard, Gitte
    [J]. COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PROCEEDINGS, 2006, 4061 : 1122 - 1130
  • [10] Audio and Visual Exaggerated Expressive Speech Generation of English Language Learning Based on Automatic Context Algorithm
    Huang, Jie
    Gong, Xun
    [J]. IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 1774 - 1777