共 50 条
- [1] Hierarchical Vision-Language Alignment for Video Captioning MULTIMEDIA MODELING (MMM 2019), PT I, 2019, 11295 : 42 - 54
- [2] Open-World Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 275 - 292
- [4] Vision-Language Knowledge Exploration for Video Saliency Prediction PATTERN RECOGNITION AND COMPUTER VISION, PT IX, PRCV 2024, 2025, 15039 : 191 - 205
- [5] MixPrompt: Enhancing Generalizability and Adversarial Robustness for Vision-Language Models via Prompt Fusion ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IX, ICIC 2024, 2024, 14870 : 328 - 339
- [6] FashionGPT: A Large Vision-Language Model for Enhancing Fashion Understanding ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT V, 2024, 15020 : 308 - 323
- [7] Enhancing Concept-Based Explanation with Vision-Language Models 2024 IEEE 37TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS 2024, 2024, : 219 - 224
- [8] Revisiting Classifier: Transferring Vision-Language Models for Video Recognition THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2847 - 2855
- [10] Zero-shot Object Detection Through Vision-Language Embedding Alignment 2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 926 - 940