共 50 条
- [1] Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 869 - 893
- [3] Perceptual Grouping in Contrastive Vision-Language Models 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5548 - 5561
- [4] SUGARCREPE: Fixing Hackable Benchmarks for Vision-Language Compositionality ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [5] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 15840 - 15853
- [6] Text Promptable Surgical Instrument Segmentation with Vision-Language Models ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [7] Learning the Visualness of Text Using Large Vision-Language Models 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 2394 - 2408
- [9] Contrastive Region Guidance: Improving Grounding in Vision-Language Models Without Training COMPUTER VISION - ECCV 2024, PT LXXIX, 2025, 15137 : 198 - 215