共 50 条
- [21] Knowledge Boosting: Rethinking Medical Contrastive Vision-Language Pre-training [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT I, 2023, 14220 : 405 - 415
- [22] Vision-Language Pre-Training: Basics, Recent Advances, and Future Trends [J]. FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2022, 14 (3-4): : 163 - 352
- [23] ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3135 - 3146
- [24] Kaleido-BERT: Vision-Language Pre-training on Fashion Domain [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12642 - 12652
- [25] Subsampling of Frequent Words in Text for Pre-training a Vision-Language Model [J]. PROCEEDINGS OF THE 1ST WORKSHOP ON LARGE GENERATIVE MODELS MEET MULTIMODAL APPLICATIONS, LGM3A 2023, 2023, : 61 - 67
- [26] EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [28] MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23262 - 23271
- [29] Automated Bridge Inspection Image Interpretation Based on Vision-Language Pre-Training [J]. COMPUTING IN CIVIL ENGINEERING 2023-DATA, SENSING, AND ANALYTICS, 2024, : 1 - 8
- [30] Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5120 - 5131