Variable Selection Linear Regression for Robust Speech Recognition

被引:1
|
作者
Tsao, Yu [1 ]
Hu, Ting-Yao [2 ]
Sakti, Sakriani [3 ]
Nakamura, Satoshi [3 ]
Lee, Lin-shan [2 ]
机构
[1] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 115, Taiwan
[2] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei 10764, Taiwan
[3] Nara Inst Sci & Technol, Grad Sch Informat Sci, Ikoma 6300192, Japan
关键词
variable selection; linear regression; MLLR; fMLLR; model space adaptation; feature space adaptation; RAPID SPEAKER ADAPTATION; NOISY ENVIRONMENTS; HMM ADAPTATION; TRANSFORMATION; EIGENSPACE; EIGENVOICE;
D O I
10.1587/transinf.E97.D.1477
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study proposes a variable selection linear regression (VSLR) adaptation framework to improve the accuracy of automatic speech recognition (ASR) with only limited and unlabeled adaptation data. The proposed framework can be divided into three phases. The first phase prepares multiple variable subsets by applying a ranking filter to the original regression variable set. The second phase determines the best variable subset based on a pre-determined performance evaluation criterion and computes a linear regression (LR) mapping function based on the determined subset. The third phase performs adaptation in either model or feature spaces. The three phases can select the optimal components and remove redundancies in the LR mapping function effectively and thus enable VSLR to provide satisfactory adaptation performance even with a very limited number of adaptation statistics. We formulate model space VSLR and feature space VSLR by integrating the VS techniques into the conventional LR adaptation systems. Experimental results on the Aurora-4 task show that model space VSLR and feature space VSLR, respectively, outperform standard maximum likelihood linear regression (MLLR) and feature space MLLR (fMLLR) and their extensions, with notable word error rate (WER) reductions in a per-utterance unsupervised adaptation manner.
引用
收藏
页码:1477 / 1487
页数:11
相关论文
共 50 条
  • [21] Noisy Constrained Maximum-Likelihood Linear Regression for Noise-Robust Speech Recognition
    Kim, D. K.
    Gales, M. J. F.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 315 - 325
  • [22] Maximum likelihood polynomial regression for robust speech recognition
    L Yong WU Zhenyang (School of Information Science and Engineering
    Chinese Journal of Acoustics, 2011, 30 (03) : 358 - 370
  • [23] Maximum likelihood polynomial regression for robust speech recognition
    Lü, Yong
    Wu, Zhenyang
    Shengxue Xuebao/Acta Acustica, 2010, 35 (01): : 88 - 96
  • [24] Adaptive Training with Noisy Constrained Maximum Likelihood Linear Regression for Noise Robust Speech Recognition
    Kim, D. K.
    Gales, M. J. F.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2367 - 2370
  • [25] Outlier robust model selection in linear regression
    Müller, S
    Welsh, AH
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (472) : 1297 - 1310
  • [26] Algorithms for robust model selection in linear regression
    Morgenthaler, S
    Welsch, RE
    Zenide, A
    THEORY AND APPLICATION OF RECENT ROBUST METHODS, 2004, : 195 - 206
  • [27] Quantile regression for robust estimation and variable selection in partially linear varying-coefficient models
    Yang, Jing
    Lu, Fang
    Yang, Hu
    STATISTICS, 2017, 51 (06) : 1179 - 1199
  • [28] Robust Variable Selection in Linear Mixed Models
    Fan, Yali
    Qin, Guoyou
    Zhu, Zhong Yi
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2014, 43 (21) : 4566 - 4581
  • [29] Robust variable selection for finite mixture regression models
    Qingguo Tang
    R. J. Karunamuni
    Annals of the Institute of Statistical Mathematics, 2018, 70 : 489 - 521
  • [30] Robust Variable Selection and Estimation in Threshold Regression Model
    Bo-wen Li
    Yun-qi Zhang
    Nian-sheng Tang
    Acta Mathematicae Applicatae Sinica, English Series, 2020, 36 : 332 - 346