共 50 条
- [2] Hierarchical Object-oriented Spatio-Temporal Reasoning for Video Question Answering PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 636 - 642
- [3] Spatio-Temporal Context Networks for Video Question Answering ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 108 - 118
- [4] Discovering Spatio-Temporal Rationales for Video Question Answering 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13823 - 13832
- [5] Spatio-Temporal Graph Convolution Transformer for Video Question Answering IEEE Access, 2024, 12 : 131664 - 131680
- [6] Dynamic Spatio-Temporal Modular Network for Video Question Answering PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4466 - 4477
- [8] Video Question Answering via Hierarchical Spatio-Temporal Attention Networks PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3518 - 3524
- [9] TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1359 - 1367
- [10] Divide and Conquer: Question-Guided Spatio-Temporal Contextual Attention for Video Question Answering THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11101 - 11108