共 50 条
- [2] DeAR: Debiasing Vision-Language Models with Additive Residuals 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6820 - 6829
- [3] Toward Building General Foundation Models for Language, Vision, and Vision-Language Understanding Tasks FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 551 - 568
- [5] Adventures of Trustworthy Vision-Language Models: A Survey THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22650 - 22658
- [6] Causal Attention for Vision-Language Tasks 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9842 - 9852
- [7] Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19175 - 19186
- [8] Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 513 - 529
- [9] Vision-language navigation: a survey and taxonomy NEURAL COMPUTING & APPLICATIONS, 2024, 36 (07): : 3291 - 3316
- [10] Vision-language navigation: a survey and taxonomy Neural Computing and Applications, 2024, 36 : 3291 - 3316