共 50 条
- [41] Investigating Compositional Challenges in Vision-Language Models for Visual Grounding 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 14141 - 14151
- [42] In-Context Impersonation Reveals Large Language Models' Strengths and Biases ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [43] Unifying Visual and Vision-Language Tracking via Contrastive Learning THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4107 - 4116
- [44] Iterative Forward Tuning Boosts In-Context Learning in Language Models PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 15460 - 15473
- [47] GalLoP: Learning Global and Local Prompts for Vision-Language Models COMPUTER VISION - ECCV 2024, PT LXI, 2025, 15119 : 264 - 282
- [49] JoAPR: Cleaning the Lens of Prompt Learning for Vision-Language Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 28695 - 28705