共 50 条
- [3] Cross-Modal Multistep Fusion Network With Co-Attention for Visual Question Answering [J]. IEEE ACCESS, 2018, 6 : 31516 - 31524
- [4] Cross-Modal Visual Question Answering for Remote Sensing Data [J]. 2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 57 - 65
- [5] Cross-modal Relational Reasoning Network for Visual Question Answering [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3939 - 3948
- [6] Cross-Modal Self-Attention with Multi-Task Pre-Training for Medical Visual Question Answering [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 456 - 460
- [7] Cross-Modal Retrieval for Knowledge-Based Visual Question Answering [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 421 - 438
- [10] Cross-Modal Dense Passage Retrieval for Outside Knowledge Visual Question Answering [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2829 - 2834