共 50 条
- [31] SegEQA: Video Segmentation Based Visual Attention for Embodied Question Answering 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9666 - 9675
- [32] Language-aware Visual Semantic Distillation for Video Question Answering 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 27103 - 27113
- [33] A Causal Approach to Mitigate Modality Preference Bias in Medical Visual Question Answering PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON VISION-LANGUAGE MODELS FOR BIOMEDICAL APPLICATIONS, VLM4BIO 2024, 2024, : 13 - 17
- [38] Video Graph Transformer for Video Question Answering COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 39 - 58
- [40] Video Reference: A Video Question Answering Engine ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 799 - +