共 50 条
- [21] Towards the explainability of Multimodal Speech Emotion Recognition INTERSPEECH 2021, 2021, : 1748 - 1752
- [22] Temporal Multimodal Learning in Audiovisual Speech Recognition 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3574 - 3582
- [23] CONTINUOUS VISUAL SPEECH RECOGNITION FOR MULTIMODAL FUSION 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
- [25] END-TO-END MULTIMODAL SPEECH RECOGNITION 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5774 - 5778
- [26] Improved Lip Contour Extraction For Visual Speech Recognition 2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2015, : 459 - 462
- [28] Lip location normalized training for visual speech recognition IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2000, E83D (11): : 1969 - 1977
- [29] Visual Lip Contour Detection for the Purpose of Speech Recognition 2014 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS (ICSES), 2014,
- [30] Speech recognition in adverse environments using lip information IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 149 - 152