共 50 条
- [1] FashionGPT: A Large Vision-Language Model for Enhancing Fashion Understanding ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT V, 2024, 15020 : 308 - 323
- [2] Attention Prompting on Image for Large Vision-Language Models COMPUTER VISION - ECCV 2024, PT XXX, 2025, 15088 : 251 - 268
- [3] Graph neural networks in vision-language image understanding: a survey VISUAL COMPUTER, 2025, 41 (01): : 491 - 516
- [5] Vary: Scaling up the Vision Vocabulary for Large Vision-Language Model COMPUTER VISION-ECCV 2024, PT IV, 2025, 15062 : 408 - 424
- [8] Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19175 - 19186
- [10] Distilling Large Vision-Language Model with Out-of-Distribution Generalizability 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2492 - 2503