共 50 条
- [1] Audio-visual speech translation with automatic lip synchronization and face tracking based on 3-D read model [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2117 - 2120
- [2] On the Audio-visual Synchronization for Lip-to-Speech Synthesis [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 7809 - 7818
- [3] Realtime lip contour tracking for audio-visual speech recognition applications [J]. World Academy of Science, Engineering and Technology, 2009, 40 : 164 - 167
- [4] Lip Tracking Method for the System of Audio-Visual Polish Speech Recognition [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT I, 2012, 7267 : 535 - 542
- [5] THE USE OF DYNAMIC DEFORMABLE TEMPLATES FOR LIP TRACKING IN AN AUDIO-VISUAL CORPUS WITH LARGE VARIATIONS IN HEAD POSE, FACE ILLUMINATION AND LIP SHAPES [J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 370 - 373
- [8] Multimodal Learning Using 3D Audio-Visual Data or Audio-Visual Speech Recognition [J]. 2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 40 - 43
- [10] Audio-visual speech recognition integrating 3D lip information obtained from the Kinect [J]. Multimedia Systems, 2016, 22 : 315 - 323