共 50 条
- [41] GridMM: Grid Memory Map for Vision-and-Language Navigation 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15579 - 15590
- [42] InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 485 - 492
- [43] KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2583 - 2592
- [44] Sub-Instruction Aware Vision-and-Language Navigation PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3360 - 3376
- [45] Action Inference for Destination Prediction in Vision-and-Language Navigation PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 4: STUDENT RESEARCH WORKSHOP, 2024, : 210 - 217
- [46] OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 : 245 - 256
- [47] NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7641 - 7649
- [48] Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [49] Federated Learning for Vision-and-Language Grounding Problems THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11572 - 11579
- [50] Are You Looking? Grounding to Multiple Modalities in Vision-and-Language Navigation 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 6551 - 6557