共 50 条
- [2] Spatio-Temporal Context Networks for Video Question Answering [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 108 - 118
- [3] Video -Context Aligned Transformer for Video Question Answering [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19795 - 19803
- [5] Video Question Answering with Spatio-Temporal Reasoning [J]. International Journal of Computer Vision, 2019, 127 : 1385 - 1412
- [6] Discovering Spatio-Temporal Rationales for Video Question Answering [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13823 - 13832
- [7] Spatio-Temporal Graph Convolution Transformer for Video Question Answering [J]. IEEE Access, 2024, 12 : 131664 - 131680
- [8] Dynamic Spatio-Temporal Modular Network for Video Question Answering [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4466 - 4477