共 50 条
- [1] Audio-Visual Predictive Coding for Self-Supervised Visual Representation Learning [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9912 - 9919
- [3] SCLAV: Supervised Cross-modal Contrastive Learning for Audio-Visual Coding [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 261 - 270
- [4] Self-Supervised Audio-Visual Representation Learning for in-the-wild Videos [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5671 - 5672
- [5] Self-Supervised Learning by Cross-Modal Audio-Video Clustering [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [6] Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning [J]. 2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
- [8] SELF-SUPERVISED LEARNING FOR AUDIO-VISUAL SPEAKER DIARIZATION [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4367 - 4371
- [9] Self-Supervised Visual Representations for Cross-Modal Retrieval [J]. ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 182 - 186
- [10] Self-Supervised Correlation Learning for Cross-Modal Retrieval [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2851 - 2863