共 50 条
- [1] Deep Modular Co-Attention Networks for Visual Question Answering [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6274 - 6283
- [2] Deep Attention Neural Tensor Network for Visual Question Answering [J]. COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 21 - 37
- [5] ADAPTIVE ATTENTION FUSION NETWORK FOR VISUAL QUESTION ANSWERING [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 997 - 1002
- [6] Co-Attention Network With Question Type for Visual Question Answering [J]. IEEE ACCESS, 2019, 7 : 40771 - 40781
- [8] Modular dual-stream visual fusion network for visual question answering [J]. VISUAL COMPUTER, 2024,
- [10] Word-to-region attention network for visual question answering [J]. Multimedia Tools and Applications, 2019, 78 : 3843 - 3858