共 50 条
- [21] Asynchronous stream modeling for large vocabulary audio-visual speech recognition [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 169 - 172
- [22] Multi-stream asynchrony modeling for audio-visual speech recognition [J]. ISM 2007: NINTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2007, : 37 - 44
- [23] An audio-visual corpus for speech perception and automatic speech recognition (L) [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (05): : 2421 - 2424
- [24] End-to-end audio-visual speech recognition for overlapping speech [J]. INTERSPEECH 2021, 2021, : 3016 - 3020
- [25] Indonesian Audio-Visual Speech Corpus for Multimodal Automatic Speech Recognition [J]. 2017 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2017, : 381 - 385
- [26] A Robust Audio-visual Speech Recognition Using Audio-visual Voice Activity Detection [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2702 - +
- [27] Large vocabulary audio-visual speech recognition using the Janus speech recognition toolkit [J]. PATTERN RECOGNITION, 2004, 3175 : 488 - 495
- [28] Audio-visual speech experience with age influences perceived audio-visual asynchrony in speech [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (04): : 3001 - 3010
- [29] Audio-Visual Speech Recognition in the Presence of a Competing Speaker [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1292 - 1295