共 50 条
- [2] Cross-Modal Multistep Fusion Network With Co-Attention for Visual Question Answering [J]. IEEE ACCESS, 2018, 6 : 31516 - 31524
- [3] Cross-Modal Visual Question Answering for Remote Sensing Data [J]. 2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 57 - 65
- [4] Cross-modal Relational Reasoning Network for Visual Question Answering [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3939 - 3948
- [6] Cross-Modal Retrieval for Knowledge-Based Visual Question Answering [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 421 - 438
- [9] Cross-Modal Dense Passage Retrieval for Outside Knowledge Visual Question Answering [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2829 - 2834
- [10] Jointly Learning Attentions with Semantic Cross-Modal Correlation for Visual Question Answering [J]. DATABASES THEORY AND APPLICATIONS, ADC 2017, 2017, 10538 : 248 - 260