共 50 条
- [31] Improving speech embedding using crossmodal transfer learning with audio-visual data Multimedia Tools and Applications, 2019, 78 : 15681 - 15704
- [33] Boosting and structure learning in dynamic Bayesian networks for audio-visual speaker detection 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 789 - 794
- [35] An audio-visual speech recognition with a new mandarin audio-visual database INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, : 19 - +
- [36] Robust Audio-Visual Speech Synchrony Detection by Generalized Bimodal Linear Prediction INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2219 - +
- [37] Human Audio-Visual Consonant Recognition Analyzed with Three Bimodal Integration Models INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 820 - 823
- [38] Audio-Visual Speech Synchronization Detection Using a Bimodal Linear Prediction Model 2009 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPR WORKSHOPS 2009), VOLS 1 AND 2, 2009, : 670 - +
- [39] Noisy audio feature enhancement using audio-visual speech data 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2025 - 2028