Maximum likelihood polynomial regression for robust speech recognition

被引:0
|
作者
L Yong WU Zhenyang (School of Information Science and Engineering
机构
基金
中国国家自然科学基金;
关键词
MLLR; Maximum likelihood polynomial regression for robust speech recognition; MFCC;
D O I
10.15949/j.cnki.0217-9776.2011.03.004
中图分类号
TN912.34 [语音识别与设备];
学科分类号
0711 ;
摘要
The linear hypothesis is the main disadvantage of maximum likelihood linear regression (MLLR).This paper applies the polynomial regression method to model adaptation and establishes a nonlinear model adaptation algorithm using maximum likelihood polynomial regression(MLPR)for robust speech recognition.In this algorithm,the nonlinear relationship between training and testing Gaussian means in every Mel channel is approximated by a set of polynomials and the polynomial coefficients are estimated from adaptation data in test environment using the expectation-maximization(EM)algorithm and maximum likelihood(ML) criterion.The experimental results show that the second-order polynomial can approximate the actual nonlinear function better and in noise compensation and speaker adaptation,the word error rates of MLPR are significantly lower than those of MLLR.The proposed MLPR algorithm overcomes the limitation of linear hypothesis well and can decrease the impact of noise,speaker and other factors simultaneously.It is especially suitable for joint adaptation of speaker and noise.
引用
收藏
页码:358 / 370
页数:13
相关论文
共 50 条
  • [1] Maximum likelihood polynomial regression for robust speech recognition
    Lü, Yong
    Wu, Zhenyang
    [J]. Shengxue Xuebao/Acta Acustica, 2010, 35 (01): : 88 - 96
  • [2] Maximum likelihood subband polynomial regression for robust speech recognition
    Lu, Yong
    Wu, Zhenyang
    [J]. APPLIED ACOUSTICS, 2013, 74 (05) : 640 - 646
  • [3] Noisy Constrained Maximum-Likelihood Linear Regression for Noise-Robust Speech Recognition
    Kim, D. K.
    Gales, M. J. F.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 315 - 325
  • [4] Adaptive Training with Noisy Constrained Maximum Likelihood Linear Regression for Noise Robust Speech Recognition
    Kim, D. K.
    Gales, M. J. F.
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2367 - 2370
  • [5] JOINT CONSTRAINED MAXIMUM LIKELIHOOD REGRESSION FOR OVERLAPPING SPEECH RECOGNITION
    Kumatani, Kenichi
    Singh, Rita
    Faubel, Friedrich
    McDonough, John
    Oualil, Youssef
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 121 - 125
  • [6] REGULARIZED CONSTRAINED MAXIMUM LIKELIHOOD LINEAR REGRESSION FOR SPEECH RECOGNITION
    Ghalehjegh, Sina Hamidi
    Rose, Richard C.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [7] A Variational Approach to Robust Maximum Likelihood Estimation for Speech Recognition
    Omar, Mohamed Kamal
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1049 - 1052
  • [8] MAXIMUM LIKELIHOOD ADAPTATION OF HISTOGRAM EQUALIZATION WITH CONSTRAINT FOR ROBUST SPEECH RECOGNITION
    Xiao, Xiong
    Li, Jinyu
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5480 - 5483
  • [9] Maximum likelihood sub-band adaptation for robust speech recognition
    Zhu, DL
    Nakamura, S
    Paliwal, KK
    Wang, RH
    [J]. SPEECH COMMUNICATION, 2005, 47 (03) : 243 - 264
  • [10] Maximum likelihood joint estimation of channel and noise for robust speech recognition
    Zhao, YX
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1109 - 1112