共 50 条
- [3] Multimodal Graph Reasoning and Fusion for Video Question Answering [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1410 - 1415
- [4] MUTAN: Multimodal Tucker Fusion for Visual Question Answering [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2631 - 2639
- [6] Deep Multimodal Reinforcement Network with Contextually Guided Recurrent Attention for Image Question Answering [J]. Journal of Computer Science and Technology, 2017, 32 : 738 - 748
- [8] Improving Visual Question Answering by Multimodal Gate Fusion Network [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
- [10] Multimodal Graph Transformer for Multimodal Question Answering [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 189 - 200