Intelligent visual question answering in TCM education: An innovative application of IoT and multimodal fusion

被引:0
|
作者
Bi, Wei [1 ]
Xiong, Qingzhen [1 ]
Chen, Xingyi [1 ]
Du, Qingkun [2 ]
Wu, Jun [3 ]
Zhuang, Zhaoyu [4 ]
机构
[1] College of Art and Design, Guangdong University of Finance & Economics, Guangzhou,510320, China
[2] College of Arts and Education, Guangdong Jiangmen Preschool Teachers College, Jiangmen,529000, China
[3] School of Art and Design, Division of Arts, Shenzhen University, Shenzhen,518061, China
[4] Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen,518055, China
关键词
D O I
10.1016/j.aej.2024.12.052
中图分类号
学科分类号
摘要
Medicinal chemistry
引用
下载
收藏
页码:325 / 336
相关论文
共 50 条
  • [21] MUREL: Multimodal Relational Reasoning for Visual Question Answering
    Cadene, Remi
    Ben-younes, Hedi
    Cord, Matthieu
    Thome, Nicolas
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1989 - 1998
  • [22] QAlayout: Question Answering Layout Based on Multimodal Attention for Visual Question Answering on Corporate Document
    Mahamoud, Ibrahim Souleiman
    Coustaty, Mickael
    Joseph, Aurelie
    d'Andecy, Vincent Poulain
    Ogier, Jean-Marc
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 659 - 673
  • [23] Application of Multimodal Transformer Model in Intelligent Agricultural Disease Detection and Question-Answering Systems
    Lu, Yuchun
    Lu, Xiaoyi
    Zheng, Liping
    Sun, Min
    Chen, Siyu
    Chen, Baiyan
    Wang, Tong
    Yang, Jiming
    Lv, Chunli
    PLANTS-BASEL, 2024, 13 (07):
  • [24] Multimodal Encoders and Decoders with Gate Attention for Visual Question Answering
    Li, Haiyan
    Han, Dezhi
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2021, 18 (03) : 1023 - 1040
  • [25] Multimodal Local Perception Bilinear Pooling for Visual Question Answering
    Lao, Mingrui
    Guo, Yanming
    Wang, Hui
    Zhang, Xin
    IEEE ACCESS, 2018, 6 : 57923 - 57932
  • [26] Dual-Key Multimodal Backdoors for Visual Question Answering
    Walmer, Matthew
    Sikka, Karan
    Sur, Indranil
    Shrivastava, Abhinav
    Jha, Susmit
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15354 - 15364
  • [27] ADAPTIVE ATTENTION FUSION NETWORK FOR VISUAL QUESTION ANSWERING
    Gu, Geonmo
    Kim, Seong Tae
    Ro, Yong Man
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 997 - 1002
  • [28] Relational reasoning and adaptive fusion for visual question answering
    Shen, Xiang
    Han, Dezhi
    Zong, Liang
    Guo, Zihan
    Hua, Jie
    APPLIED INTELLIGENCE, 2024, 54 (06) : 5062 - 5080
  • [29] Contrastive training of a multimodal encoder for medical visual question answering
    Silva, Joao Daniel
    Martins, Bruno
    Magalhaes, Joao
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 18
  • [30] Multimodal attention-driven visual question answering for Malayalam
    Kovath A.G.
    Nayyar A.
    Sikha O.K.
    Neural Computing and Applications, 2024, 36 (24) : 14691 - 14708