共 50 条
- [21] Video Question Answering by Frame Attention [J]. ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
- [22] BERT Representations for Video Question Answering [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1545 - 1554
- [23] Invariant Grounding for Video Question Answering [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2918 - 2927
- [24] Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3078 - 3089
- [25] Leveraging Video Descriptions to Learn Video Question Answering [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4334 - 4340
- [27] Document retrieval in the context of question answering [J]. ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 571 - 579
- [29] TempQuestions: A Benchmark for Temporal Question Answering [J]. COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1057 - 1062
- [30] Question answering with imperfect temporal information [J]. FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2006, 4027 : 647 - 658