共 8 条
- [1] Fine-Grained Visual Prompt Learning of Vision-Language Models for Image Recognition [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5828 - 5836
- [2] VLUE: A Multi-Task Benchmark for Evaluating Vision-Language Models [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [3] MAMO: Fine-Grained Vision-Language Representations Learning with Masked Multimodal Modeling [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1528 - 1538
- [4] Open-set Fine-grained Retrieval via Prompting Vision-Language Evaluator [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19381 - 19391
- [5] Global-to-Contextual Shared Semantic Learning for Fine-Grained Vision-Language Alignment [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 281 - 293
- [6] ViLLA: Fine-Grained Vision-Language Representation Learning from Real-World Data [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22168 - 22178
- [7] FashionSAP: Symbols and Attributes Prompt for Fine-grained Fashion Vision-Language Pre-training [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15028 - 15038