共 50 条
- [32] Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2383 - 2395
- [33] Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2634 - 2645
- [34] Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19175 - 19186
- [36] Masked Vision-language Transformer in Fashion Machine Intelligence Research, 2023, 20 : 421 - 434
- [38] Causal Attention for Vision-Language Tasks 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9842 - 9852
- [39] Vision-Language Models for Biomedical Applications PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON VISION-LANGUAGE MODELS FOR BIOMEDICAL APPLICATIONS, VLM4BIO 2024, 2024, : 1 - 2