共 50 条
- [21] Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15624 - 15634
- [22] Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5749 - 5757
- [23] HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23507 - 23517
- [24] Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2383 - 2395
- [25] Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2634 - 2645
- [26] MAMO: Fine-Grained Vision-Language Representations Learning with Masked Multimodal Modeling [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1528 - 1538
- [27] Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting [J]. COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 284 - 302
- [28] DeepUnseen: Unpredicted Event Recognition Through Integrated Vision-Language Models [J]. 2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 48 - 50
- [29] Image as a Language: Revisiting Scene Text Recognition via Balanced, Unified and Synchronized Vision-Language Reasoning Network [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5885 - 5893
- [30] Knowledge Boosting: Rethinking Medical Contrastive Vision-Language Pre-training [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT I, 2023, 14220 : 405 - 415