Multimodal Graph Transformer for Multimodal Question Answering

被引:0
|
作者
He, Xuehai [1 ]
Wang, Xin Eric [1 ]
机构
[1] UC Santa Cruz, United States
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
17th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2023
中图分类号
学科分类号
摘要
Computational linguistics - Natural language processing systems - Semantics
引用
下载
收藏
页码:189 / 200
相关论文
共 50 条
  • [41] Hierarchical Conditional Relation Networks for Multimodal Video Question Answering
    Thao Minh Le
    Vuong Le
    Svetha Venkatesh
    Truyen Tran
    International Journal of Computer Vision, 2021, 129 : 3027 - 3050
  • [42] FTN-VQA: MULTIMODAL REASONING BY LEVERAGING A FULLY TRANSFORMER-BASED NETWORK FOR VISUAL QUESTION ANSWERING
    Wang, Runmin
    Xu, Weixiang
    Zhu, Yanbin
    Zhu, Zhenlin
    Chen, Hua
    Ding, Yajun
    Liu, Jinping
    Gao, Changxin
    Sang, Nong
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2023, 31 (06)
  • [43] Syntax-Informed Question Answering with Heterogeneous Graph Transformer
    Zhu, Fangyi
    Tan, Lok You
    Ng, See-Kiong
    Bressan, Stephane
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022, PT I, 2022, 13426 : 17 - 31
  • [44] Knowledge Graph Enhanced Transformer for Generative Question Answering Tasks
    Liang, Chaojie
    Yang, Jingying
    Fu, Xianghua
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 267 - 280
  • [45] Contrastive Video Question Answering via Video Graph Transformer
    Xiao, Junbin
    Zhou, Pan
    Yao, Angela
    Li, Yicong
    Hong, Richang
    Yan, Shuicheng
    Chua, Tat-Seng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13265 - 13280
  • [46] Multimodal Transformer for Multimodal Machine Translation
    Yao, Shaowei
    Wan, Xiaojun
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4346 - 4350
  • [47] Visual Experience-Based Question Answering with Complex Multimodal Environments
    Kim, Incheol
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020 (2020)
  • [48] Video Question Answering Scheme Base on Multimodal Knowledge Active Learning
    Liu M.
    Wang R.
    Zhou F.
    Lin G.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (04): : 889 - 902
  • [49] Multimodal Encoder-Decoder Attention Networks for Visual Question Answering
    Chen, Chongqing
    Han, Dezhi
    Wang, Jun
    IEEE ACCESS, 2020, 8 : 35662 - 35671
  • [50] Deep multimodal relational reasoning and guided attention for chart question answering
    Srivastava, Swati
    Sharma, Himanshu
    Journal of Electronic Imaging, 2024, 33 (06)