共 50 条
- [4] Cross-Modal Visual Question Answering for Remote Sensing Data [J]. 2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 57 - 65
- [5] Cross-Modal Multistep Fusion Network With Co-Attention for Visual Question Answering [J]. IEEE ACCESS, 2018, 6 : 31516 - 31524
- [6] VCD: Visual Causality Discovery for Cross-Modal Question Reasoning [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VII, 2024, 14431 : 309 - 322
- [8] Cross-Modal Retrieval for Knowledge-Based Visual Question Answering [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 421 - 438
- [10] Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1097 - 1103