共 50 条
- [32] Towards Adversarial Attack on Vision-Language Pre-training Models PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5005 - 5013
- [33] MAFA: Managing False Negatives for Vision-Language Pre-training 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 27304 - 27314
- [35] Multimodal Pre-training Method for Vision-language Understanding and Generation Ruan Jian Xue Bao/Journal of Software, 2023, 34 (05): : 2024 - 2034
- [36] Unified Vision-Language Pre-Training for Image Captioning and VQA THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13041 - 13049
- [37] Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 19068 - 19079
- [38] EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition 2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
- [40] On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning? 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 23783 - 23793