共 50 条
- [41] SCLAV: Supervised Cross-modal Contrastive Learning for Audio-Visual Coding [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 261 - 270
- [43] A Closer Look at Weakly-Supervised Audio-Visual Source Localization [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [44] Self-Supervised Visual Representations Learning by Contrastive Mask Prediction [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10140 - 10149
- [45] PointCMP: Contrastive Mask Prediction for Self-supervised Learning on Point Cloud Videos [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1212 - 1222
- [46] Audio-Visual Self-Supervised Terrain Type Recognition for Ground Mobile Platforms [J]. IEEE ACCESS, 2021, 9 : 29970 - 29979
- [47] Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1456 - 1463
- [48] Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3884 - 3892
- [49] Self-Supervised Audio-Visual Feature Learning for Single-Modal Incremental Terrain Type Clustering [J]. IEEE ACCESS, 2021, 9 : 64346 - 64357
- [50] Single-modal Incremental Terrain Clustering from Self-Supervised Audio-Visual Feature Learning [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9399 - 9406