Lip temporal pattern analysis for automatic visual speech recognition

被引:0
|
作者
Xie, L [1 ]
Cai, XL [1 ]
Fu, ZH [1 ]
Jiang, DM [1 ]
Zhao, RC [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
关键词
visual speech recognition; lipreading; feature extraction; lip temporal pattern;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel approach to processing temporal lip motion information for dynamic visual feature extraction in visual speech recognition. The long-time Lip TenipoRA1 Patterns (LipTRAPs) of visual phonemes are introduced to analyze the nature of lip shape changes when uttering speech. A dynamic visual feature is also proposed based on the LipTRAPs. Visual speech recognition experiments on a connected-digits task show that the LipTRAP feature can yield significant WRR improvments than conventional delta features.
引用
收藏
页码:703 / 706
页数:4
相关论文
共 50 条
  • [41] Adaptive fusion of acoustic and visual sources for automatic speech recognition
    Rogozan, A
    Deléglise, P
    SPEECH COMMUNICATION, 1998, 26 (1-2) : 149 - 161
  • [42] Automatic integrated analysis of jaw and lip movement in speech production
    Borghese, NA
    Ferrigno, G
    Redolfi, M
    Pedotti, A
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (01): : 482 - 487
  • [43] Asynchronous integration of visual information in an automatic speech recognition system
    Alissali, M
    Deleglise, P
    Rogozan, A
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 34 - 37
  • [44] Temporal pattern discrimination and speech recognition under electrical simulation
    1600, American Inst of Physics, Woodbury, NY, USA (96):
  • [45] Realtime lip contour tracking for audio-visual speech recognition applications
    Yazdi, Mehran
    Seyfi, Mehdi
    Rafati, Amirhossein
    Asadi, Meghdad
    World Academy of Science, Engineering and Technology, 2009, 40 : 164 - 167
  • [46] A robust hierarchical lip tracking approach for lipreading and audio visual speech recognition
    Xie, L
    Cai, XL
    Fu, ZH
    Zhao, RC
    Jiang, DM
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 3620 - 3624
  • [47] Lip Tracking Using Particle Filter and Geometric Model for Visual Speech Recognition
    Jarraya, Islem
    Werda, Salah
    Mahdi, Walid
    2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS (SIGMAP), 2014, : 172 - 179
  • [48] Lip Tracking Method for the System of Audio-Visual Polish Speech Recognition
    Kubanek, Mariusz
    Bobulski, Janusz
    Adrjanowicz, Lukasz
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT I, 2012, 7267 : 535 - 542
  • [49] Automatic Speech Recognition using Correlation Analysis
    Pramanik, Arnab
    Raha, Rajorshee
    PROCEEDINGS OF THE 2012 WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES, 2012, : 670 - 674
  • [50] Spectral Analysis for Automatic Speech Recognition and Enhancement
    Oruh, Jane
    Viriri, Serestina
    MACHINE LEARNING FOR NETWORKING, MLN 2020, 2021, 12629 : 245 - 254