Lip temporal pattern analysis for automatic visual speech recognition

被引:0
|
作者
Xie, L [1 ]
Cai, XL [1 ]
Fu, ZH [1 ]
Jiang, DM [1 ]
Zhao, RC [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
关键词
visual speech recognition; lipreading; feature extraction; lip temporal pattern;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel approach to processing temporal lip motion information for dynamic visual feature extraction in visual speech recognition. The long-time Lip TenipoRA1 Patterns (LipTRAPs) of visual phonemes are introduced to analyze the nature of lip shape changes when uttering speech. A dynamic visual feature is also proposed based on the LipTRAPs. Visual speech recognition experiments on a connected-digits task show that the LipTRAP feature can yield significant WRR improvments than conventional delta features.
引用
收藏
页码:703 / 706
页数:4
相关论文
共 50 条
  • [21] Automatic speech recognition using audio visual cues
    Yashwanth, H
    Mahendrakar, H
    David, S
    PROCEEDINGS OF THE IEEE INDICON 2004, 2004, : 166 - 169
  • [22] Research on an automated speech pattern recognition system based on lip movement
    Zhang, B
    Fukui, Y
    PROCEEDINGS OF THE 18TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 18, PTS 1-5, 1997, 18 : 1530 - 1531
  • [23] Acoustic Analysis for Automatic Speech Recognition
    O'Shaughnessy, Douglas
    PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1038 - 1053
  • [24] Lip movement synthesis in audio-visual speech recognition system
    Li, Junquan
    Yin, Yixin
    Proc. 2005 IEEE Int. Conf. on Lang. Process. Knowl. Engin. IEEE NLP-KE '05, (461-465):
  • [25] Lip movement synthesis in audio-visual speech recognition system
    Li, JQ
    Yin, YX
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 461 - 465
  • [26] SPEECH RECOGNITION USING PATTERN ANALYSIS
    SOROKIN, VN
    ENGINEERING CYBERNETICS, 1966, (05): : 90 - &
  • [27] Audio-Visual Automatic Speech Recognition Using PZM, MFCC and Statistical Analysis
    Debnath, Saswati
    Roy, Pinki
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2021, 7 (02): : 121 - 133
  • [28] Visual-speech-pass filtering for robust automatic lip-reading
    Jong-Seok Lee
    Pattern Analysis and Applications, 2014, 17 : 611 - 621
  • [30] An audio-visual corpus for speech perception and automatic speech recognition (L)
    Cooke, Martin
    Barker, Jon
    Cunningham, Stuart
    Shao, Xu
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (05): : 2421 - 2424