LASSO ENVIRONMENT MODEL COMBINATION FOR ROBUST SPEECH RECOGNITION

被引:0
|
作者
Xiao, Xiong [1 ]
Li, Jinyu [2 ]
Chng, Eng Siong [1 ,2 ,3 ]
Li, Haizhou [1 ,3 ,4 ]
机构
[1] Nanyang Technol Univ, Temasek Lab, Singapore, Singapore
[2] Microsoft Corp, Redmond, WA 98052 USA
[3] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
[4] Inst Infocomm Res, Dept Human Language Technol, Singapore, Singapore
关键词
noise robust speech recognition; model adaptation; L1; regularization; Lasso regression; model combination; SPEAKER ADAPTATION; REGRESSION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a novel acoustic model adaptation method for noise robust speech recognition. Model combination is a common way to adapt acoustic models to a target test environment. For example, the mean supervectors of the adapted model are obtained as a linear combination of mean supervectors of many pre-trained environment-dependent acoustic models. Usually, the combination weights are estimated using a maximum likelihood (ML) criterion and the weights are nonzero for all the mean supervectors. We propose to estimate the weights by using Lasso (least absolute shrinkage and selection operator) which imposes an L1 regularization term in the weight estimation problem to shrink some weights to exactly zero. Our study shows that Lasso usually shrinks to zero the weights of those mean supervectors not relevant to the test environment. By removing some nonrelevant supervectors, the obtained mean supervectors are found to be more robust against noise distortions. Experimental results on Aurora-2 task show that the Lasso-based mean combination consistently outperforms ML-based combination.
引用
收藏
页码:4305 / 4308
页数:4
相关论文
共 50 条
  • [1] Robust continuous speech recognition using parallel model combination
    Gales, MJF
    Young, SJ
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (05): : 352 - 359
  • [2] SPEECH ENHANCEMENT FOR ROBUST SPEECH RECOGNITION IN MOTORCYCLE ENVIRONMENT
    Mporas, Iosif
    Ganchev, Todor
    Kocsis, Otilia
    Fakotakis, Nikos
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2010, 19 (02) : 159 - 173
  • [3] Acoustic feature combination for robust speech recognition
    Zolnay, A
    Schlüter, R
    Ney, H
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 457 - 460
  • [4] Robust speech recognition for car environment noise
    Kokubo, H
    Amano, A
    Hataoka, N
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (11): : 65 - 73
  • [5] Robust phoneme recognition for a speech therapy environment
    Grossinho, Andre
    Guimaraes, Isabel
    Magalhaes, Joao
    Cavaco, Sofia
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON SERIOUS GAMES AND APPLICATIONS FOR HEALTH, 2016,
  • [6] An environment adaptation method for robust speech recognition
    Han, JQ
    Zhang, L
    Wang, CF
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 726 - 729
  • [7] Noise-robust speech recognition by discriminative adaptation in parallel model combination
    Chung, YJ
    [J]. ELECTRONICS LETTERS, 2000, 36 (04) : 370 - 371
  • [8] APPROXIMATED PARALLEL MODEL COMBINATION FOR EFFICIENT NOISE-ROBUST SPEECH RECOGNITION
    Sim, Khe Chai
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7383 - 7387
  • [9] Feature Compensation Employing Model Combination for Robust In-Vehicle Speech Recognition
    Kim, Wooil
    Hansen, John H. L.
    [J]. IN-VEHICLE CORPUS AND SIGNAL PROCESSING FOR DRIVER BEHAVIOR, 2009, : 233 - +
  • [10] ROBUST SPEECH RECOGNITION IN ADDITIVE AND CONVOLUTIONAL NOISE USING PARALLEL MODEL COMBINATION
    GALES, MJF
    YOUNG, SJ
    [J]. COMPUTER SPEECH AND LANGUAGE, 1995, 9 (04): : 289 - 307