共 50 条
- [21] Deep Multimodal Reinforcement Network with Contextually Guided Recurrent Attention for Image Question Answering [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 32 (04): : 738 - 748
- [22] Deep Multimodal Reinforcement Network with Contextually Guided Recurrent Attention for Image Question Answering [J]. Journal of Computer Science and Technology, 2017, 32 : 738 - 748
- [23] Hierarchical Attention Networks for Fact-based Visual Question Answering [J]. Multimedia Tools and Applications, 2024, 83 : 17281 - 17298
- [24] Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering [J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 451 - 466
- [27] Deep Modular Co-Attention Networks for Visual Question Answering [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6274 - 6283
- [30] Question -Led object attention for visual question answering [J]. NEUROCOMPUTING, 2020, 391 : 227 - 233