共 50 条
- [1] E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 503 - 513
- [2] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [3] Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6789 - 6798
- [4] Vision-Language Pre-Training with Triple Contrastive Learning [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15650 - 15659
- [5] Survey on Vision-language Pre-training [J]. Ruan Jian Xue Bao/Journal of Software, 2023, 34 (05): : 2000 - 2023
- [6] SPEECH-LANGUAGE PRE-TRAINING FOR END-TO-END SPOKEN LANGUAGE UNDERSTANDING [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7458 - 7462
- [7] Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 1073 - 1083
- [8] Speech Model Pre-training for End-to-End Spoken Language Understanding [J]. INTERSPEECH 2019, 2019, : 814 - 818
- [9] UNIMO-2: End-to-End Unified Vision-Language Grounded Learning [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3187 - 3201
- [10] VLP: A Survey on Vision-language Pre-training [J]. MACHINE INTELLIGENCE RESEARCH, 2023, 20 (01) : 38 - 56