共 50 条
- [1] FACE LANDMARK-BASED SPEAKER-INDEPENDENT AUDIO-VISUAL SPEECH ENHANCEMENT IN MULTI-TALKER ENVIRONMENTS [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6900 - 6904
- [2] PERMUTATION INVARIANT TRAINING OF DEEP MODELS FOR SPEAKER-INDEPENDENT MULTI-TALKER SPEECH SEPARATION [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 241 - 245
- [3] Permutation invariant training of deep models for speaker-independent multi-talker speech separation [J]. MECHANICAL ENGINEERING JOURNAL, 2023,
- [4] AN EMPIRICAL STUDY OF VISUAL FEATURES FOR DNN BASED AUDIO-VISUAL SPEECH ENHANCEMENT IN MULTI-TALKER ENVIRONMENTS [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8418 - 8422
- [5] Audio-Visual Multi-Talker Speech Recognition in A Cocktail Party [J]. INTERSPEECH 2021, 2021, : 3021 - 3025
- [6] Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation [J]. ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):
- [7] Multi-Stream Gated and Pyramidal Temporal Convolutional Neural Networks for Audio-Visual Speech Separation in Multi-Talker Environments [J]. INTERSPEECH 2021, 2021, : 1104 - 1108
- [8] An Attention Based Speaker-Independent Audio-Visual Deep Learning Model for Speech Enhancement [J]. MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 722 - 728
- [9] Acoustic scene complexity affects motion behavior during speech perception in audio-visual multi-talker virtual environments [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
- [10] Speaker independent audio-visual speech recognition [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1073 - 1076