共 50 条
- [2] Adventures of Trustworthy Vision-Language Models: A Survey [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22650 - 22658
- [3] Causal Attention for Vision-Language Tasks [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9842 - 9852
- [4] Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19175 - 19186
- [5] Vision-language navigation: a survey and taxonomy [J]. Neural Computing and Applications, 2024, 36 : 3291 - 3316
- [6] Vision-language navigation: a survey and taxonomy [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (07): : 3291 - 3316
- [7] NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8312 - 8322
- [8] Learning to Prompt for Vision-Language Models [J]. International Journal of Computer Vision, 2022, 130 : 2337 - 2348
- [9] Learning to Prompt for Vision-Language Models [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (09) : 2337 - 2348
- [10] VISION-LANGUAGE MODELS AS SUCCESS DETECTORS [J]. CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 120 - 136