共 50 条
- [21] TVLT: Textless Vision-Language Transformer ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [22] The Neglected Tails in Vision-Language Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 12988 - 12997
- [23] Vision-language navigation: a survey and taxonomy Neural Computing and Applications, 2024, 36 : 3291 - 3316
- [24] VISION-LANGUAGE MODELS AS SUCCESS DETECTORS CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 120 - 136
- [25] Vision-Language Fusion for Object Recognition THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4603 - 4610
- [26] 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation COMPUTER VISION - ECCV 2024, PT XL, 2025, 15098 : 21 - 38
- [27] Accelerating Vision-Language Pretraining with Free Language Modeling 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23161 - 23170
- [28] Towards Better Vision-Inspired Vision-Language Models 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13537 - 13547
- [29] Enhancing Automatic Placenta Analysis Through Distributional Feature Recomposition in Vision-Language Contrastive Learning MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 116 - 126
- [30] Language Features Matter: Effective Language Representations for Vision-Language Tasks 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7473 - 7482