共 50 条
- [2] HiVLP: Hierarchical Interactive Video-Language Pre-Training 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13710 - 13720
- [3] Object-aware Video-language Pre-training for Retrieval 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3303 - 3312
- [4] All in One: Exploring Unified Video-Language Pre-training 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6598 - 6608
- [5] HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15359 - 15370
- [7] VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4227 - 4239
- [8] EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5262 - 5274
- [9] Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [10] Masked Latent Semantic Modeling: an Efficient Pre-training Alternative to Masked Language Modeling FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 13949 - 13962