Minimum Hypothesis Phone Error as a Decoding Method for Speech Recognition

被引：0

作者：

Xu, Haihua ^{[1
]}

Povey, Daniel ^{[2
]}

Zhu, Jie ^{[1
]}

Wu, Guanyong ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China

[2] Microsoft Res, Redmond, WA 14865 USA

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

Minimum Bayes Risk (MBR); MPE; Confusion Networks; Speech Recognition; Lattice Rescoring;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we show how methods for approximating phone error as normally used for Minimum Phone Error (MPE) discriminative training, can be used instead as a decoding criterion for lattice rescoring. This is an alternative to Confusion Networks (CN) which are commonly used in speech recognition. The standard (Maximum A Posteriori) decoding approach is a Minimum Bayes Risk estimate with respect to the Sentence Error Rate (SER); however, we are typically more interested in the Word Error Rate (WER). Methods such as CN and our proposed Minimum Hypothesis Phone Error (MHPE) aim to get closer to minimizing the expected WER. Based on preliminary experiments we find that our approach gives more improvement than CN, and is conceptually simpler.

引用

页码：92 / +

页数：2

共 50 条

[1] MINIMUM PHONE ERROR BASED STREAM WEIGHT TRAINING FOR MANDARIN AUDIO-VISUAL SPEECH RECOGNITION
Wu, Guanyong
Zhu, Jie
Xu, Haihua
ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 902 - 905
[2] EVALUATION AND ANALYSIS OF MINIMUM PHONE ERROR TRAINING AND ITS MODIFIED VERSIONS FOR LARGE VOCABULARY MANDARIN SPEECH RECOGNITION
Cheng, Yung-Jen
Lin, Che-Kuang
Lee, Lin-Shan
2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 157 - 160
[3] Minimum classification error rate methods for speech recognition
Juang, BH
Chou, W
Lee, CH
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03): : 257 - 265
[4] Nonlinear Regularization Decoding Method for Speech Recognition
Zhang, Jiang
Wang, Liejun
Yu, Yinfeng
Xu, Miaomiao
SENSORS, 2024, 24 (12)
[5] Segmental minimum Bayes-Risk decoding for automatic speech recognition
Goel, V
Kumar, S
Byrne, W
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (03): : 234 - 249
[6] Smoothing Method for Improved Minimum Phone Error Linear Regression
Qi, Yaohui
Pan, Fuping
Ge, Fengpei
Zhao, Qingwei
Yan, Yonghong
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (08): : 2105 - 2113
[7] Minimum Bayes error feature selection for continuous speech recognition
Saon, G
Padmanabhan, M
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 800 - 806
[8] PROTOTYPE-BASED MINIMUM ERROR TRAINING FOR SPEECH RECOGNITION
MCDERMOTT, E
KATAGIRI, S
APPLIED INTELLIGENCE, 1994, 4 (03) : 245 - 256
[9] Subspace method for minimum error pattern recognition
Watanabe, H
Katagiri, S
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1997, E80D (12) : 1195 - 1204
[10] A minimum classification error method for face recognition
Chen, LH
Chen, JR
Liang, D
Deng, SH
Liao, HY
SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND ITS APPLICATIONS, 1999, (465): : 630 - 633

← 1 2 3 4 5 →