共 50 条
- [41] Hierarchical Conditional Relation Networks for Multimodal Video Question Answering International Journal of Computer Vision, 2021, 129 : 3027 - 3050
- [43] Syntax-Informed Question Answering with Heterogeneous Graph Transformer DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022, PT I, 2022, 13426 : 17 - 31
- [44] Knowledge Graph Enhanced Transformer for Generative Question Answering Tasks ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 267 - 280
- [46] Multimodal Transformer for Multimodal Machine Translation 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4346 - 4350
- [48] Video Question Answering Scheme Base on Multimodal Knowledge Active Learning Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (04): : 889 - 902
- [49] Multimodal Encoder-Decoder Attention Networks for Visual Question Answering IEEE ACCESS, 2020, 8 : 35662 - 35671