共 50 条
- [31] RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 547 - 556
- [32] Pre-Training Multi-Modal Dense Retrievers for Outside-Knowledge Visual Question Answering PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 169 - 176
- [33] K-armed Bandit based Multi-modal Network Architecture Search for Visual Question Answering MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1245 - 1254
- [34] TASK-ORIENTED MULTI-MODAL QUESTION ANSWERING FOR COLLABORATIVE APPLICATIONS 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1426 - 1430
- [35] MMTF: Multi-Modal Temporal Fusion for Commonsense Video Question Answering 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4659 - 4664
- [36] Multi-modal Question Answering System Driven by Domain Knowledge Graph 5TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM 2019), 2019, : 43 - 47
- [37] Multi-Modal Knowledge-Aware Attention Network for Question Answering Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (05): : 1037 - 1045
- [39] Hierarchical Multi-Task Learning for Diagram Question Answering with Multi-Modal Transformer PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1313 - 1321
- [40] Dual Path Multi-Modal High-Order Features for Textual Content based Visual Question Answering 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4324 - 4331