ON NOISE ESTIMATION FOR ROBUST SPEECH RECOGNITION USING VECTOR TAYLOR SERIES

被引:8
|
作者
Zhao, Yong [1 ]
Juang, Biing-Hwang [1 ]
机构
[1] Georgia Inst Technol, Ctr Signal & Image Proc, Atlanta, GA 30332 USA
关键词
Robust speech recognition; vector Taylor series; noise estimation;
D O I
10.1109/ICASSP.2010.5495669
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a novel noise variance estimation method using the fixed point method for the VTS-based robust speech recognition. Noise parameters are re-estimated over a given utterance using an EM algorithm. The derivative of the auxiliary function with respect to the noise variance is resolved, and the fixed point algorithm estimates the noise variance by recursively approximating the root of the resulting derivative. The method leads to a re-estimation formula with a flavor like the standard ML variance estimation, and the iteration procedure is step-size free. We also investigate improving the noise estimation for efficient VTS adaptation. Several fast noise estimation methods are examined including estimation from non-speech areas and incremental adaptation. In the evaluation over Aurora 2 database, the proposed noise variance estimation method obtains a significant improvement in recognition accuracy over the method using sample variance. Further experiments show that the VTS ML estimation over non-speech areas is an effective fast adaptation method. The final refined approach achieves 8.75% WER, 13% relative improvement over the conventional VTS adaptation.
引用
收藏
页码:4290 / 4293
页数:4
相关论文
共 50 条
  • [11] Multi-environment model adaptation based on vector Taylor series for robust speech recognition
    Lue, Yong
    Wu, Haiyang
    Zhou, Lin
    Wu, Zhenyang
    [J]. PATTERN RECOGNITION, 2010, 43 (09) : 3093 - 3099
  • [12] AN ANALYSIS OF VECTOR TAYLOR SERIES MODEL COMPENSATION FOR NON-STATIONARY NOISE IN SPEECH RECOGNITION
    Duc Hoang Ha Nguyen
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 131 - 135
  • [13] SPECTRAL ESTIMATION FOR NOISE ROBUST SPEECH RECOGNITION
    ERELL, A
    WEINTRAUB, M
    [J]. SPEECH AND NATURAL LANGUAGE, 1989, : 319 - 324
  • [14] Speech recognition in noisy environments using first-order vector Taylor series
    Kim, DY
    Un, CK
    Kim, NS
    [J]. SPEECH COMMUNICATION, 1998, 24 (01) : 39 - 49
  • [15] A vector Taylor series approach for environment-independent speech recognition
    Moreno, PJ
    Raj, B
    Stern, RM
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 733 - 736
  • [16] Robust speech recognition using adaptive noise threshold estimation and wavelet shrinkage
    Pham, Tuan Vam
    Kubin, Gernot
    Rank, Erhard
    [J]. 2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 2008, : 204 - +
  • [17] Noise Spectrum Estimation Using Line Spectral Frequencies for Robust Speech Recognition
    Jang, Gil-Jin
    Park, Jeong-Sik
    Kim, Sanghun
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2012, 31 (03): : 179 - 187
  • [18] Sequential noise estimation with optimal forgetting for robust speech recognition
    Afify, M
    Siohan, O
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 229 - 232
  • [20] Histogram equalization with Bayesian estimation for noise robust speech recognition
    Suh, Youngjoo
    Kim, Hoirin
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (02): : 677 - 685