共 50 条
- [1] Video Graph Transformer for Video Question Answering COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 39 - 58
- [2] Video -Context Aligned Transformer for Video Question Answering THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19795 - 19803
- [3] Embedding VLAD in Transformer for Video Question Answering Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (04): : 671 - 689
- [4] Multi-interaction Network with Object Relation for Video Question Answering PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1193 - 1201
- [6] Redundancy-aware Transformer for Video Question Answering PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3172 - 3180
- [7] Object-Centric Representation Learning for Video Question Answering 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
- [8] Complementary spatiotemporal network for video question answering Multimedia Systems, 2022, 28 : 161 - 169
- [10] ATM: Action Temporality Modeling for Video Question Answering PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4886 - 4895