Recognition of visual speech elements using adaptively boosted hidden Markov models

被引:29
|
作者
Foo, SW [1 ]
Lian, Y
Dong, L
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 119260, Singapore
[3] Natl Univ Singapore, Dept Elect & Comp Engn, Digital Syst & Applicat Lab, Singapore 117548, Singapore
关键词
adaptive boosting (AdaBoost); automatic lip reading; hidden Markov model (HMM); visual speech processing;
D O I
10.1109/TCSVT.2004.826773
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The performance of automatic speech recognition (ASR) system can be significantly enhanced with additional information from visual speech elements such as the movement of lips, tongue, and teeth, especially under noisy environment. In this paper, a novel approach for recognition of visual speech elements is presented. The approach makes use of adaptive boosting (AdaBoost) and hidden Markov models (HMMs) to build an AdaBoost-HMM classifier. The composite HMMs of the AdaBoost-HMM classifier are trained to cover different groups of training samples using the AdaBoost technique and the biased Baum-Welch training method. By combining the decisions of the component classifiers of the composite HMMs according to a novel probability synthesis rule, a more complex decision boundary is formulated than using the single HMM classifier. The method is applied to the recognition of the basic visual speech elements. Experimental results show that the AdaBoost-HMM classifier outperforms the traditional HMM classifier in accuracy, especially for visemes extracted from contexts.
引用
收藏
页码:693 / 705
页数:13
相关论文
共 50 条
  • [1] Recognition of visual speech elements using Hidden Markov Models
    Foo, SW
    Dong, L
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2002, PROCEEDING, 2002, 2532 : 607 - 614
  • [2] Visual speech recognition using Active Shape Models and Hidden Markov Models
    Luettin, J
    Thacker, NA
    Beet, SW
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 817 - 820
  • [3] Visual speech recognition using motion features and hidden Markov models
    Yau, Wai Chee
    Kumar, Dinesh Kant
    Weghorn, Hans
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2007, 4673 : 832 - 839
  • [4] Boosted Large-Margin Estimation of Hidden Markov Models for Speech Recognition
    Xu Shuangyin
    Dan, Qu
    [J]. PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 545 - 548
  • [5] Speech emotion recognition using hidden Markov models
    Nwe, TL
    Foo, SW
    De Silva, LC
    [J]. SPEECH COMMUNICATION, 2003, 41 (04) : 603 - 623
  • [6] HIDDEN MARKOV MODELS IN SPEECH RECOGNITION
    Krajcovic, J.
    Hrncar, M.
    Muzikarova, E.
    [J]. ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2008, 7 (1-2) : 250 - 252
  • [7] Continuous Malayalam Speech Recognition Using Hidden Markov Models
    Mohamed, Anuj
    Nair, K. N. Ramachandran
    [J]. PROCEEDINGS OF THE FIRST AMRITA ACM-W CELEBRATION OF WOMEN IN COMPUTING IN INDIA (A2WIC), 2010,
  • [8] Isolated Malay speech recognition using Hidden Markov Models
    Rosdi, Fadhilah
    Ainon, Raja N.
    [J]. 2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 721 - 725
  • [9] Speech Recognition of Moroccan Dialect Using Hidden Markov Models
    Mouaz, Bezoui
    Abderrahim, Beni Hssane
    Abdelmajid, Elmoutaouakkil
    [J]. 10TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2019) / THE 2ND INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40 2019) / AFFILIATED WORKSHOPS, 2019, 151 : 985 - 991
  • [10] Method of speech recognition and speaker identification using audio-visual of polish speech and hidden Markov models
    Kubanek, Mariusz
    [J]. BIOMETRICS, COMPUTER SECURITY SYSTEMS AND ARTIFICIAL INTELLIGENCE APPLICATIONS, 2006, : 45 - 55