共 50 条
- [21] DeAR: Debiasing Vision-Language Models with Additive Residuals [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6820 - 6829
- [23] VinVL: Revisiting Visual Representations in Vision-Language Models [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5575 - 5584
- [24] Effectiveness assessment of recent large vision-language models [J]. Visual Intelligence, 2 (1):
- [25] Towards an Exhaustive Evaluation of Vision-Language Foundation Models [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 339 - 352
- [26] On Evaluating Adversarial Robustness of Large Vision-Language Models [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [28] ProVLA: Compositional Image Search with Progressive Vision-Language Alignment and Multimodal Fusion [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2764 - 2769
- [29] ILLUME: Rationalizing Vision-Language Models through Human Interactions [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202