共 50 条
- [1] Correlation Information Bottleneck: Towards Adapting Pretrained Multimodal Models for Robust Visual Question Answering International Journal of Computer Vision, 2024, 132 : 185 - 207
- [2] RSAdapter: Adapting Multimodal Models for Remote Sensing Visual Question Answering IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
- [3] VISUAL QUESTION ANSWERING IN REMOTE SENSING WITH CROSS-ATTENTION AND MULTIMODAL INFORMATION BOTTLENECK IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6278 - 6281
- [4] Adapting Visual Question Answering Models for Enhancing Multimodal Community Q&A Platforms PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1421 - 1430
- [5] Multimodal Attention for Visual Question Answering INTELLIGENT COMPUTING, VOL 1, 2019, 858 : 783 - 792
- [6] Robust Explanations for Visual Question Answering 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1566 - 1575
- [7] Multimodal Learning and Reasoning for Visual Question Answering ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
- [8] Faithful Multimodal Explanation for Visual Question Answering BLACKBOXNLP WORKSHOP ON ANALYZING AND INTERPRETING NEURAL NETWORKS FOR NLP AT ACL 2019, 2019, : 103 - 112
- [9] Visual Commonsense in Pretrained Unimodal and Multimodal Models NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5321 - 5335