共 50 条
- [31] Single-modal Incremental Terrain Clustering from Self-Supervised Audio-Visual Feature Learning 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9399 - 9406
- [34] LEARNING CONTEXTUALLY FUSED AUDIO-VISUAL REPRESENTATIONS FOR AUDIO-VISUAL SPEECH RECOGNITION 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1346 - 1350
- [35] Self-Supervised Visual Representations Learning by Contrastive Mask Prediction 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10140 - 10149
- [36] Towards Efficient and Effective Self-supervised Learning of Visual Representations COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 523 - 538
- [39] AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian Awareness 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1871 - 1877