共 50 条
- [1] Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6778 - 6788
- [2] NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7641 - 7649
- [3] Kiki or Bouba? Sound Symbolism in Vision-and-Language Models [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [4] Speaker-Follower Models for Vision-and-Language Navigation [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [5] Measuring Progress in Fine-grained Vision-and-Language Understanding [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 1559 - 1582
- [7] Iterative Vision-and-Language Navigation [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14921 - 14930
- [8] WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models [J]. arXiv, 1600,
- [9] WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [10] Tools Identification By On-Board Adaptation of Vision-and-Language Models [J]. THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23799 - 23801