共 50 条
- [33] Knowledge Boosting: Rethinking Medical Contrastive Vision-Language Pre-training [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT I, 2023, 14220 : 405 - 415
- [34] Vision-Language Pre-Training: Basics, Recent Advances, and Future Trends [J]. FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2022, 14 (3-4): : 163 - 352
- [35] Kaleido-BERT: Vision-Language Pre-training on Fashion Domain [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12642 - 12652
- [36] ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3135 - 3146
- [37] EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [38] BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [39] Weakly Supervised Grounding for VQA in Vision-Language Transformers [J]. COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 652 - 670