共 50 条
- [1] Multi-scale Relational Reasoning with Regional Attention for Visual Question Answering [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5642 - 5649
- [2] Multi-modal co-attention relation networks for visual question answering [J]. The Visual Computer, 2023, 39 : 5783 - 5795
- [3] Multi-modal co-attention relation networks for visual question answering [J]. VISUAL COMPUTER, 2023, 39 (11): : 5783 - 5795
- [4] Differentiated Attention with Multi-modal Reasoning for Video Question Answering [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 525 - 530
- [6] Multi-modal adaptive gated mechanism for visual question answering [J]. PLOS ONE, 2023, 18 (06):
- [8] Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing [J]. IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXVIII, 2022, 12267