共 50 条
- [31] CMAL: A Novel Cross-Modal Associative Learning Framework for Vision-Language Pre-Training PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4515 - 4524
- [32] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12971 - 12980
- [34] MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23262 - 23271
- [35] Automated Bridge Inspection Image Interpretation Based on Vision-Language Pre-Training COMPUTING IN CIVIL ENGINEERING 2023-DATA, SENSING, AND ANALYTICS, 2024, : 1 - 8
- [37] MAKE: Vision-Language Pre-training based Product Retrieval in Taobao Search COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 356 - 360
- [38] Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5120 - 5131
- [39] Efficient Medical Images Text Detection with Vision-Language Pre-training Approach ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
- [40] VLMO: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,