共 50 条
- [41] Interactive Co-Learning with Cross-Modal Transformer for Audio-Visual Emotion Recognition INTERSPEECH 2022, 2022, : 4740 - 4744
- [42] Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1336 - 1345
- [44] Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9723 - 9732
- [45] IMPROVING AUDIO-VISUAL SPEECH RECOGNITION PERFORMANCE WITH CROSS-MODAL STUDENT-TEACHER TRAINING 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6560 - 6564
- [48] Audio-Visual Embedding for Cross-Modal Music Video Retrieval through Supervised Deep CCA 2018 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2018), 2018, : 143 - 150
- [49] Maintaining frame rate perception in interactive environments by exploiting audio-visual cross-modal interaction VISUAL COMPUTER, 2011, 27 (01): : 57 - 66