共 50 条
- [3] Pairwise VLAD Interaction Network for Video Question Answering PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5119 - 5127
- [5] Multi-Attention Relation Network for Figure Question Answering KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, 2022, 13369 : 667 - 680
- [6] Multi-Scale Progressive Attention Network for Video Question Answering ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 973 - 978
- [7] Multi-Scale Progressive Attention Network for Video Question Answering ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference, 2021, 2 : 873 - 878
- [8] Advancing Video Question Answering with a Multi-modal and Multi-layer Question Enhancement Network PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3985 - 3993
- [9] Text-Guided Object Detector for Multi-modal Video Question Answering 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 1032 - 1042