Intelligent visual question answering in TCM education: An innovative application of IoT and multimodal fusion

被引:0
|
作者
Bi, Wei [1 ]
Xiong, Qingzhen [1 ]
Chen, Xingyi [1 ]
Du, Qingkun [2 ]
Wu, Jun [3 ]
Zhuang, Zhaoyu [4 ]
机构
[1] College of Art and Design, Guangdong University of Finance & Economics, Guangzhou,510320, China
[2] College of Arts and Education, Guangdong Jiangmen Preschool Teachers College, Jiangmen,529000, China
[3] School of Art and Design, Division of Arts, Shenzhen University, Shenzhen,518061, China
[4] Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen,518055, China
关键词
D O I
10.1016/j.aej.2024.12.052
中图分类号
学科分类号
摘要
Medicinal chemistry
引用
下载
收藏
页码:325 / 336
相关论文
共 50 条
  • [31] Multimodal Graph Networks for Compositional Generalization in Visual Question Answering
    Saqur, Raeid
    Narasimhan, Karthik
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [32] Fusion of Detected Objects in Text for Visual Question Answering
    Alberti, Chris
    Ling, Jeffrey
    Collins, Michael
    Reitter, David
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2131 - 2140
  • [33] Dual-Key Multimodal Backdoors for Visual Question Answering
    Walmer, Matthew
    Sikka, Karan
    Sur, Indranil
    Shrivastava, Abhinav
    Jha, Susmit
    arXiv, 2021,
  • [34] Visual Question Answering based on multimodal triplet knowledge accumuation
    Wang, Fengjuan
    An, Gaoyun
    2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 81 - 84
  • [35] DSAF: A Dual-Stage Attention Based Multimodal Fusion Framework for Medical Visual Question Answering
    K. Mukesh
    S. L. Jayaprakash
    R. Prasanna Kumar
    SN Computer Science, 6 (4)
  • [36] CONTEXT RELATION FUSION MODEL FOR VISUAL QUESTION ANSWERING
    Zhang, Haotian
    Wu, Wei
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2112 - 2116
  • [37] Advanced Visual and Textual Co-context Aware Attention Network with Dependent Multimodal Fusion Block for Visual Question Answering
    Hesam Shokri Asri
    Reza Safabakhsh
    Multimedia Tools and Applications, 2024, 83 (40) : 87959 - 87986
  • [38] Visual Experience-Based Question Answering with Complex Multimodal Environments
    Kim, Incheol
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020 (2020)
  • [39] Multimodal Encoder-Decoder Attention Networks for Visual Question Answering
    Chen, Chongqing
    Han, Dezhi
    Wang, Jun
    IEEE ACCESS, 2020, 8 : 35662 - 35671
  • [40] Bidirectional cascaded multimodal attention for multiple choice visual question answering
    Sushmita Upadhyay
    Sanjaya Shankar Tripathy
    Machine Vision and Applications, 2025, 36 (2)