共 50 条
- [1] Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5731 - 5746
- [2] COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2188 - 2197
- [3] CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3966 - 3977
- [4] Cross-modal Semantic Alignment Pre-training for Vision-and-Language Navigation [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4233 - 4241
- [5] Vision Language Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14660 - 14679
- [6] Multi-Lingual Acquisition on Multimodal Pre-training for Cross-modal Retrieval [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [7] Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2843 - 2852
- [8] VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [10] Unicoder-VL: A Universal Encoder for Vision and Language by Cross-Modal Pre-Training [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11336 - 11344