共 50 条
- [41] Core Challenges in Embodied Vision-Language Planning [J]. Journal of Artificial Intelligence Research, 2022, 74 : 459 - 515
- [42] Vision-Language Models for Robot Success Detection [J]. THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23750 - 23752
- [43] Learning to Prompt for Vision-Language Emotion Recognition [J]. 2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,
- [45] HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4768 - 4777
- [47] Structured Scene Memory for Vision-Language Navigation [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8451 - 8460
- [48] Survey on Vision-language Pre-training [J]. Ruan Jian Xue Bao/Journal of Software, 2023, 34 (05): : 2000 - 2023
- [49] Task Residual for Tuning Vision-Language Models [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10899 - 10909
- [50] Perceptual Grouping in Contrastive Vision-Language Models [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5548 - 5561