共 50 条
- [3] Cross-modal Map Learning for Vision and Language Navigation [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15439 - 15449
- [4] SINC: Self-Supervised In-Context Learning for Vision-Language Tasks [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15384 - 15396
- [5] SELF-SUPERVISED LEARNING WITH CROSS-MODAL TRANSFORMERS FOR EMOTION RECOGNITION [J]. 2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 381 - 388
- [6] Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning [J]. 2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
- [7] Self-Supervised Learning by Cross-Modal Audio-Video Clustering [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [8] Self-Supervised Visual Representations for Cross-Modal Retrieval [J]. ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 182 - 186
- [9] Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15470 - 15479
- [10] Learning Mutual Modulation for Self-supervised Cross-Modal Super-Resolution [J]. COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 : 1 - 18