共 50 条
- [2] Multi-level Attention Networks for Visual Question Answering [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4187 - 4195
- [5] Multi-level, multi-modal interactions for visual question answering over text in images [J]. World Wide Web, 2022, 25 : 1607 - 1623
- [7] Multi-level, multi-modal interactions for visual question answering over text in images [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (04): : 1607 - 1623
- [8] Mutual Attention Inception Network for Remote Sensing Visual Question Answering [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
- [9] Multi-level Contrastive Learning for Commonsense Question Answering [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2023, 2023, 14120 : 318 - 331
- [10] Multi-grained Attention with Object-level Grounding for Visual Question Answering [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3595 - 3600