Maximum Penalized Likelihood Kernel Regression for Fast Adaptation

被引:11
|
作者
Mak, Brian Kan-Wing [1 ]
Lai, Tsz-Chung [1 ]
Tsang, Ivor W. [2 ]
Kwok, James Tin-Yau [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
Kernel regression; maximum-likelihood linear regression (MLLR); reference speaker weighting; speaker adaptation; SPEAKER ADAPTATION; LINEAR-REGRESSION;
D O I
10.1109/TASL.2009.2019920
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a nonlinear generalization of the popular maximum-likelihood linear regression (MLLR) adaptation algorithm using kernel methods. The proposed method, called maximum penalized likelihood kernel regression adaptation (MPLKR), applies kernel regression with appropriate regularization to determine the affine model transform in a kernel-induced high-dimensional feature space. Although this is not the first attempt of applying kernel methods to conventional linear adaptation algorithms, unlike most of other kernelized adaptation methods such as kernel eigenvoice or kernel eigen-MLLR, MPLKR has the advantage that it is a convex optimization and its solution is always guaranteed to be globally optimal. In fact, the adapted Gaussian means can be obtained analytically by simply solving a system of linear equations. From the Bayesian perspective, MPLKR can also be considered as the kernel version of maximum a posteriori linear regression (MAPLR) adaptation. Supervised and unsupervised speaker adaptation using MPLKR were evaluated on the Resource Management and Wall Street Journal 5K tasks, respectively, achieving a word error rate reduction of 23.6% and 15.5% respectively over the speaker-independently model.
引用
收藏
页码:1372 / 1381
页数:10
相关论文
共 50 条
  • [1] Fast speaker adaption via maximum penalized likelihood kernel regression
    Tsang, Ivor W.
    Kwok, James T.
    Mak, Brian
    Zhang, Kai
    Pan, Jeffrey J.
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 997 - 1000
  • [2] Penalized maximum likelihood estimation of a stochastic multivariate regression model
    Hansen, Elizabeth
    Chan, Kung-Sik
    [J]. STATISTICS & PROBABILITY LETTERS, 2010, 80 (21-22) : 1643 - 1649
  • [3] Sparse factor regression via penalized maximum likelihood estimation
    Hirose, Kei
    Imada, Miyuki
    [J]. STATISTICAL PAPERS, 2018, 59 (02) : 633 - 662
  • [4] Sparse factor regression via penalized maximum likelihood estimation
    Kei Hirose
    Miyuki Imada
    [J]. Statistical Papers, 2018, 59 : 633 - 662
  • [5] Factored Maximum Penalized Likelihood Kernel Regression for HMM-Based Style-Adaptive Speech Synthesis
    Sung, June Sig
    Hong, Doo Hwa
    Kim, Nam Soo
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2014, 8 (02) : 251 - 261
  • [6] Maximum softly-penalized likelihood for mixed effects logistic regression
    Sterzinger, Philipp
    Kosmidis, Ioannis
    [J]. STATISTICS AND COMPUTING, 2023, 33 (02)
  • [7] Confidence sets based on penalized maximum likelihood estimators in Gaussian regression
    Poetscher, Benedikt M.
    Schneider, Ulrike
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2010, 4 : 334 - 360
  • [8] THE OPTIMALITY OF ITERATIVE MAXIMUM PENALIZED LIKELIHOOD ALGORITHMS FOR RIDGE-REGRESSION
    YU, SH
    [J]. APPLIED MATHEMATICS LETTERS, 1994, 7 (04) : 35 - 39
  • [9] Maximum softly-penalized likelihood for mixed effects logistic regression
    Philipp Sterzinger
    Ioannis Kosmidis
    [J]. Statistics and Computing, 2023, 33
  • [10] PENALIZED MAXIMUM-LIKELIHOOD ESTIMATION IN LOGISTIC-REGRESSION AND DISCRIMINATION
    ANDERSON, JA
    BLAIR, V
    [J]. BIOMETRIKA, 1982, 69 (01) : 123 - 136