共 50 条
- [1] VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly Detection THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 6074 - 6082
- [2] Weakly Supervised Grounding for VQA in Vision-Language Transformers COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 652 - 670
- [4] 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance COMPUTER VISION - ECCV 2024, PT LXXIII, 2025, 15131 : 87 - 104
- [6] SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance COMPUTER VISION - ECCV 2024, PT XXXIX, 2025, 15097 : 257 - 275
- [7] CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1006 - 1015
- [8] Text Promptable Surgical Instrument Segmentation with Vision-Language Models ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [10] Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 284 - 302