Minimum Hypothesis Phone Error as a Decoding Method for Speech Recognition

被引:0
|
作者
Xu, Haihua [1 ]
Povey, Daniel [2 ]
Zhu, Jie [1 ]
Wu, Guanyong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Microsoft Res, Redmond, WA 14865 USA
关键词
Minimum Bayes Risk (MBR); MPE; Confusion Networks; Speech Recognition; Lattice Rescoring;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we show how methods for approximating phone error as normally used for Minimum Phone Error (MPE) discriminative training, can be used instead as a decoding criterion for lattice rescoring. This is an alternative to Confusion Networks (CN) which are commonly used in speech recognition. The standard (Maximum A Posteriori) decoding approach is a Minimum Bayes Risk estimate with respect to the Sentence Error Rate (SER); however, we are typically more interested in the Word Error Rate (WER). Methods such as CN and our proposed Minimum Hypothesis Phone Error (MHPE) aim to get closer to minimizing the expected WER. Based on preliminary experiments we find that our approach gives more improvement than CN, and is conceptually simpler.
引用
收藏
页码:92 / +
页数:2
相关论文
共 50 条
  • [1] MINIMUM PHONE ERROR BASED STREAM WEIGHT TRAINING FOR MANDARIN AUDIO-VISUAL SPEECH RECOGNITION
    Wu, Guanyong
    Zhu, Jie
    Xu, Haihua
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 902 - 905
  • [2] EVALUATION AND ANALYSIS OF MINIMUM PHONE ERROR TRAINING AND ITS MODIFIED VERSIONS FOR LARGE VOCABULARY MANDARIN SPEECH RECOGNITION
    Cheng, Yung-Jen
    Lin, Che-Kuang
    Lee, Lin-Shan
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 157 - 160
  • [3] Minimum classification error rate methods for speech recognition
    Juang, BH
    Chou, W
    Lee, CH
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03): : 257 - 265
  • [4] Nonlinear Regularization Decoding Method for Speech Recognition
    Zhang, Jiang
    Wang, Liejun
    Yu, Yinfeng
    Xu, Miaomiao
    SENSORS, 2024, 24 (12)
  • [5] Segmental minimum Bayes-Risk decoding for automatic speech recognition
    Goel, V
    Kumar, S
    Byrne, W
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (03): : 234 - 249
  • [6] Smoothing Method for Improved Minimum Phone Error Linear Regression
    Qi, Yaohui
    Pan, Fuping
    Ge, Fengpei
    Zhao, Qingwei
    Yan, Yonghong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (08): : 2105 - 2113
  • [7] Minimum Bayes error feature selection for continuous speech recognition
    Saon, G
    Padmanabhan, M
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 800 - 806
  • [8] PROTOTYPE-BASED MINIMUM ERROR TRAINING FOR SPEECH RECOGNITION
    MCDERMOTT, E
    KATAGIRI, S
    APPLIED INTELLIGENCE, 1994, 4 (03) : 245 - 256
  • [9] Subspace method for minimum error pattern recognition
    Watanabe, H
    Katagiri, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1997, E80D (12) : 1195 - 1204
  • [10] A minimum classification error method for face recognition
    Chen, LH
    Chen, JR
    Liang, D
    Deng, SH
    Liao, HY
    SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND ITS APPLICATIONS, 1999, (465): : 630 - 633