共 50 条
- [1] Generative Negative Text Replay for Continual Vision-Language Pretraining COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 22 - 38
- [2] Accelerating Vision-Language Pretraining with Free Language Modeling 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23161 - 23170
- [3] Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19175 - 19186
- [4] Vision-Language Pretraining for Variable-Shot Image Classification MULTIMEDIA MODELING, MMM 2025, PT IV, 2025, 15523 : 283 - 297
- [5] PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [8] Single-Stream Multi-level Alignment for Vision-Language Pretraining COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 735 - 751
- [9] SELF-SUPERVISED VISION-LANGUAGE PRETRAINING FOR MEDIAL VISUAL QUESTION ANSWERING 2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,