共 50 条
- [22] Multi-level, multi-modal interactions for visual question answering over text in images [J]. World Wide Web, 2022, 25 : 1607 - 1623
- [23] Multi-level, multi-modal interactions for visual question answering over text in images [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (04): : 1607 - 1623
- [24] A Survey of Multi-modal Question Answering Systems for Robotics [J]. 2017 2ND INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM), 2017, : 189 - 194
- [26] Decouple Before Interact: Multi-Modal Prompt Learning for Continual Visual Question Answering [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2941 - 2950
- [29] Contrasting Dual Transformer Architectures for Multi-Modal Remote Sensing Image Retrieval [J]. APPLIED SCIENCES-BASEL, 2023, 13 (01):
- [30] VISUAL QUESTION ANSWERING FROM REMOTE SENSING IMAGES [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 4951 - 4954