共 50 条
- [41] Recognition of Isolated Digit Using Random Forest for Audio-Visual Speech Recognition [J]. Proceedings of the National Academy of Sciences, India Section A: Physical Sciences, 2022, 92 : 103 - 110
- [43] Using Twin-HMM-Based Audio-Visual Speech Enhancement as a Front-End for Robust Audio-Visual Speech Recognition [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 867 - 871
- [44] Lip Tracking Method for the System of Audio-Visual Polish Speech Recognition [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT I, 2012, 7267 : 535 - 542
- [45] Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition [J]. ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 281 - 284
- [47] Audio-visual speech recognition using minimum classification error training [J]. NEURAL NETWORKS FOR SIGNAL PROCESSING X, VOLS 1 AND 2, PROCEEDINGS, 2000, : 3 - 12
- [48] Audio-Visual Action Recognition Using Transformer Fusion Network [J]. APPLIED SCIENCES-BASEL, 2024, 14 (03):
- [49] Speaker independent audio-visual continuous speech recognition [J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : A25 - A28
- [50] Building a data corpus for audio-visual speech recognition [J]. EUROMEDIA '2007, 2007, : 88 - 92