共 50 条
- [1] AUDIO-VISUAL VOICE CONVERSION USING NOISE-ROBUST FEATURES [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
- [2] Audio-visual voice conversion using deep canonical correlation analysis for deep bottleneck features [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2469 - 2473
- [3] An Analysis of Performance Evaluation Metrics for Voice Conversion Models [J]. 2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
- [5] A Robust Audio-visual Speech Recognition Using Audio-visual Voice Activity Detection [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2702 - +
- [7] Combining audio and video metrics to assess audio-visual quality [J]. Multimedia Tools and Applications, 2018, 77 : 23993 - 24012
- [8] A Comprehensive Evaluation of Audio-Visual Behavior in Various Modes of Interviews in the Wild [J]. 12TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS (PETRA 2019), 2019, : 94 - 100
- [9] Analysis of lip geometric features for audio-visual speech recognition [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2004, 34 (04): : 564 - 570
- [10] Dynamic visual features for audio-visual speaker verification [J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (02): : 136 - 149