共 50 条
- [3] Uni-NLX: Unifying Textual Explanations for Vision and Vision-Language Tasks 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4636 - 4641
- [4] RS5M and GeoRSCLIP: A Large-Scale Vision- Language Dataset and a Large Vision-Language Model for Remote Sensing IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
- [5] NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8312 - 8322
- [6] Vary: Scaling up the Vision Vocabulary for Large Vision-Language Model COMPUTER VISION-ECCV 2024, PT IV, 2025, 15062 : 408 - 424
- [7] Stable and low-precision training for large-scale vision-language models ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [9] e-CLIP: Large-Scale Vision-Language Representation Learning in E-commerce PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3484 - 3494