共 50 条
- [42] Contrastive training of a multimodal encoder for medical visual question answering INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 18
- [43] Visual Question Answering based on multimodal triplet knowledge accumuation 2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 81 - 84
- [44] Multimodal Dual Attention Memory for Video Story Question Answering COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 698 - 713
- [46] Improving Visual Question Answering by Multimodal Gate Fusion Network 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
- [47] Speech Grammars for Textual Entailment Patterns in Multimodal Question Answering LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 3554 - 3558
- [48] Hierarchical Conditional Relation Networks for Multimodal Video Question Answering International Journal of Computer Vision, 2021, 129 : 3027 - 3050