共 50 条
- [32] Deep Multi-modal Object Detection for Autonomous Driving [J]. 2021 18TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2021, : 7 - 11
- [33] Advancing Video Question Answering with a Multi-modal and Multi-layer Question Enhancement Network [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3985 - 3993
- [34] RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 547 - 556
- [35] Pre-Training Multi-Modal Dense Retrievers for Outside-Knowledge Visual Question Answering [J]. PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 169 - 176
- [36] K-armed Bandit based Multi-modal Network Architecture Search for Visual Question Answering [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1245 - 1254
- [37] TASK-ORIENTED MULTI-MODAL QUESTION ANSWERING FOR COLLABORATIVE APPLICATIONS [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1426 - 1430
- [38] MMTF: Multi-Modal Temporal Fusion for Commonsense Video Question Answering [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4659 - 4664
- [39] Multi-modal Question Answering System Driven by Domain Knowledge Graph [J]. 5TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM 2019), 2019, : 43 - 47
- [40] Multi-Modal Knowledge-Aware Attention Network for Question Answering [J]. Xu, Changsheng (csxu@nlpr.ia.ac.cn), 1600, Science Press (57): : 1037 - 1045