共 35 条
- [22] Understanding Video Scenes through Text: Insights from Text-based Video Question Answering 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4648 - 4652
- [23] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9873 - 9883
- [25] AutoEval-Video: An Automatic Benchmark for Assessing Large Vision Language Models in Open-Ended Video Question Answering COMPUTER VISION - ECCV 2024, PT XXXVII, 2025, 15095 : 179 - 195
- [27] Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 18793 - 18803
- [29] Two-Stream Heterogeneous Graph Network with Dynamic Interactive Learning for Video Question Answering 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
- [30] A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7359 - 7368