共 50 条
- [1] STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19215 - 19223
- [3] MIST : Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14773 - 14783
- [4] A video segmentation algorithm based on spatial-temporal information 2002 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS AND WEST SINO EXPOSITION PROCEEDINGS, VOLS 1-4, 2002, : 566 - 569
- [5] Question answering with imperfect temporal information FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2006, 4027 : 647 - 658
- [6] Uncovering the Temporal Context for Video Question Answering International Journal of Computer Vision, 2017, 124 : 409 - 421
- [8] Video foreground segmentation based on analysis of spatial-temporal information Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2011, 24 (04): : 582 - 590