共 50 条
- [31] Learning Visual Question Answering by Bootstrapping Hard Attention [J]. COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 : 3 - 20
- [33] Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [34] Co-Attention Network With Question Type for Visual Question Answering [J]. IEEE ACCESS, 2019, 7 : 40771 - 40781
- [35] QAlayout: Question Answering Layout Based on Multimodal Attention for Visual Question Answering on Corporate Document [J]. DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 659 - 673
- [36] GViG: Generative Visual Grounding Using Prompt-Based Language Modeling for Visual Question Answering [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT VI, PAKDD 2024, 2024, 14650 : 83 - 94
- [40] Word-to-region attention network for visual question answering [J]. Multimedia Tools and Applications, 2019, 78 : 3843 - 3858