共 50 条
- [1] RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 547 - 556
- [2] Passage Retrieval for Outside-Knowledge Visual Question Answering [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1753 - 1757
- [3] Cross-Modal Dense Passage Retrieval for Outside Knowledge Visual Question Answering [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2829 - 2834
- [4] Knowledge-Enhanced Visual Question Answering with Multi-modal Joint Guidance [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE GRAPHS, IJCKG 2022, 2022, : 115 - 120
- [5] Multi-Modal Contrastive Pre-training for Recommendation [J]. PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 99 - 108
- [6] Cross-Modal Self-Attention with Multi-Task Pre-Training for Medical Visual Question Answering [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 456 - 460
- [9] Multi-modal adaptive gated mechanism for visual question answering [J]. PLOS ONE, 2023, 18 (06):
- [10] MULTI-MODAL PRE-TRAINING FOR AUTOMATED SPEECH RECOGNITION [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 246 - 250