共 50 条
- [41] RSAdapter: Adapting Multimodal Models for Remote Sensing Visual Question Answering IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
- [42] Multimodal Cross-guided Attention Networks for Visual Question Answering PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTER MODELING, SIMULATION AND ALGORITHM (CMSA 2018), 2018, 151 : 347 - 353
- [43] Modular dual-stream visual fusion network for visual question answering VISUAL COMPUTER, 2024, : 549 - 562
- [44] BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8102 - 8109
- [45] Research and Application of Knowledge Graph Technology for Intelligent Question Answering PAAP 2021: 2021 12TH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING, 2021, : 152 - 156
- [46] Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1554 - 1563
- [47] Visual question answering model based on the fusion of multimodal features by a two-wav co-attention mechanism IMAGING SCIENCE JOURNAL, 2021, 69 (1-4): : 177 - 189
- [48] Application of a Neural Network-based Visual Question Answering System in Preschool Language Education IEIE Transactions on Smart Processing and Computing, 2023, 12 (05): : 419 - 427