共 50 条
- [31] Multimodal Graph Networks for Compositional Generalization in Visual Question Answering ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [32] Fusion of Detected Objects in Text for Visual Question Answering 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2131 - 2140
- [34] Visual Question Answering based on multimodal triplet knowledge accumuation 2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 81 - 84
- [36] CONTEXT RELATION FUSION MODEL FOR VISUAL QUESTION ANSWERING 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2112 - 2116
- [39] Multimodal Encoder-Decoder Attention Networks for Visual Question Answering IEEE ACCESS, 2020, 8 : 35662 - 35671