Feature pruning in likelihood evaluation of HMM-based speech recognition

被引:1
|
作者
Li, X [1 ]
Bilmes, J [1 ]
机构
[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
关键词
D O I
10.1109/ASRU.2003.1318458
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present a simple yet effective technique to reduce the likelihood computation in ASR systems that use continuous density HMMs. In a variety of speech recognition tasks, likelihood evaluation accounts for a significant portion of the total computational load. Our proposed method, under certain conditions, only evaluates the component likelihoods of certain features; and approximates those of the remaining (pruned) features by prediction. We investigate two feature clustering approaches associated with our pruning technique. While a simple sequential clustering works remarkably well, a data-driven approach performs even better in its attempt to save computation while maintaining baseline recognition accuracy. With the second approach, we can speed up the likelihood evaluation by 33% and reduce its power consumption by 27% for an isolated word recognition task. For a continuous speech recognition system using either monophone or triphone models, the speedup and power reduction of the likelihood evaluation are 50% and 35% respectively.
引用
收藏
页码:303 / 308
页数:6
相关论文
共 50 条
  • [1] Efficient likelihood evaluation and dynamic Gaussian selection for HMM-based speech recognition
    Cai, Jun
    Bouselmi, Ghazi
    Laprie, Yves
    Haton, Jean-Paul
    [J]. COMPUTER SPEECH AND LANGUAGE, 2009, 23 (02): : 147 - 164
  • [2] Maximum likelihood linear transformations for HMM-based speech recognition
    Cambridge Univ Engineering Dep, Cambridge, United Kingdom
    [J]. Comput Speech Lang, 2 (75-98):
  • [3] Maximum likelihood linear transformations for HMM-based speech recognition
    Gales, MJF
    [J]. COMPUTER SPEECH AND LANGUAGE, 1998, 12 (02): : 75 - 98
  • [4] An HMM-based speech recognition IC
    Han, W
    Hon, KW
    Chan, CF
    Lee, T
    Choy, CS
    Pun, KP
    Ching, PC
    [J]. PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II: COMMUNICATIONS-MULTIMEDIA SYSTEMS & APPLICATIONS, 2003, : 744 - 747
  • [5] Lip Feature Extraction and Reduction for HMM-Based Visual Speech Recognition Systems
    Alizadeh, S.
    Boostani, R.
    Asadpour, V.
    [J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 561 - +
  • [6] Peripheral features for HMM-based speech recognition
    Fukuda, T
    Takigawa, M
    Nitta, T
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 129 - 132
  • [7] Hybrid NN/HMM-based speech recognition with a discriminant neural feature extraction
    Willett, D
    Rigoll, G
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 763 - 769
  • [8] Independent vector analysis followed by HMM-based feature enhancement for robust speech recognition
    Cho, Ji-Won
    Park, Hyung-Min
    [J]. SIGNAL PROCESSING, 2016, 120 : 200 - 208
  • [9] Using SIMD technology to speed up likelihood computation in HMM-based speech recognition systems
    Ou, Jianlin
    Cai, Jun
    Lin, Qian
    [J]. 2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 123 - 127
  • [10] An Efficient HMM-Based Feature Enhancement Method With Filter Estimation for Reverberant Speech Recognition
    Cho, Ji-Won
    Park, Hyung-Min
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (12) : 1199 - 1202