共 50 条
- [41] Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [42] Text-Guided Object Detector for Multi-modal Video Question Answering 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 1032 - 1042
- [43] Open-Ended Multi-Modal Relational Reasoning for Video Question Answering 2023 32ND IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, RO-MAN, 2023, : 363 - 369
- [45] Temporally Multi-Modal Semantic Reasoning with Spatial Language Constraints for Video Question Answering SYMMETRY-BASEL, 2022, 14 (06):
- [46] ESSAY-ANCHOR ATTENTIVE MULTI-MODAL BILINEAR POOLING FOR TEXTBOOK QUESTION ANSWERING 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
- [47] MRD-Net: Multi-Modal Residual Knowledge Distillation for Spoken Question Answering PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3985 - 3991
- [48] Gaining Extra Supervision via Multi-task learning for Multi-Modal Video Question Answering 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
- [49] Co-Attending Free-Form Regions and Detections with Multi-Modal Multiplicative Feature Embedding for Visual Question Answering THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7218 - 7225
- [50] Multi-Modal Correlated Network with Emotional Reasoning Knowledge for Social Intelligence Question-Answering 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3067 - 3073