共 50 条
- [21] MUREL: Multimodal Relational Reasoning for Visual Question Answering 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1989 - 1998
- [22] QAlayout: Question Answering Layout Based on Multimodal Attention for Visual Question Answering on Corporate Document DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 659 - 673
- [23] Application of Multimodal Transformer Model in Intelligent Agricultural Disease Detection and Question-Answering Systems PLANTS-BASEL, 2024, 13 (07):
- [25] Multimodal Local Perception Bilinear Pooling for Visual Question Answering IEEE ACCESS, 2018, 6 : 57923 - 57932
- [26] Dual-Key Multimodal Backdoors for Visual Question Answering 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15354 - 15364
- [27] ADAPTIVE ATTENTION FUSION NETWORK FOR VISUAL QUESTION ANSWERING 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 997 - 1002
- [29] Contrastive training of a multimodal encoder for medical visual question answering INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 18