共 50 条
- [1] CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [2] Unifying Vision-and-Language Tasks via Text Generation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [3] History Aware Multimodal Transformer for Vision-and-Language Navigation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [4] Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 9847 - 9857
- [6] Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 7606 - 7623
- [7] Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1207 - 1221
- [8] Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 380 - 397
- [10] HyperPELT: Unified Parameter-Efficient Language Model Tuning for Both Language and Vision-and-Language Tasks FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 11442 - 11453