共 50 条
- [22] VISUALVOICE: Audio-Visual Speech Separation with Cross-Modal Consistency 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15490 - 15500
- [23] Audio-visual fingerprinting and cross-modal aggregation: Components and applications 2008 IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS, VOLS 1 AND 2, 2008, : 243 - 246
- [25] Audio-visual Speaker Recognition with a Cross-modal Discriminative Network INTERSPEECH 2020, 2020, : 2242 - 2246
- [26] Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and Language 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10543 - 10553
- [28] Interactive Co-Learning with Cross-Modal Transformer for Audio-Visual Emotion Recognition INTERSPEECH 2022, 2022, : 4740 - 4744
- [29] Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1336 - 1345