共 50 条
- [21] ENHANCING REPRESENTATION IN MEDICAL VISION-LANGUAGE FOUNDATION MODELS VIA MULTI-SCALE INFORMATION EXTRACTION TECHNIQUES IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
- [23] Enhancing Vision-Language Models Incorporating TSK Fuzzy System for Domain Adaptation 2024 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, FUZZ-IEEE 2024, 2024,
- [24] ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3135 - 3146
- [25] VLPSR: Enhancing Zero-Shot Object ReID with Vision-Language Model ADVANCES IN VISUAL COMPUTING, ISVC 2024, PT II, 2025, 15047 : 56 - 69
- [26] Temporal Modeling Approach for Video Action Recognition Based on Vision-language Models NEURAL INFORMATION PROCESSING, ICONIP 2023, PT III, 2024, 14449 : 512 - 523
- [27] Meta-Personalizing Vision-Language Models to Find Named Instances in Video 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19123 - 19132
- [28] Improving Video Representation of Vision-Language Model with Decoupled Explicit Temporal Modeling PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII, 2025, 15037 : 525 - 539
- [29] VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly Detection THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 6074 - 6082
- [30] Vision-Language Recommendation via Attribute Augmented Multimodal Reinforcement Learning PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 39 - 47