UNSUPERVISED EQUALIZATION OF LOMBARD EFFECT FOR SPEECH RECOGNITION IN NOISY ADVERSE ENVIRONMENT

被引:4
|
作者
Boril, Hynek [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, Ctr Robust Speech Syst, Erik Jonsson Sch Engn & Comp Sci, Richardson, TX 75083 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年
关键词
Lombard effect; speech recognition; frequency warping; cepstral compensation; codebook of noisy models;
D O I
10.1109/ICASSP.2009.4960489
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
When exposed to environmental noise, speakers adjust their speech production to maintain intelligible communication. This phenomenon, called Lombard effect (LE), is known to considerably impact the performance of automatic speech recognition (ASR) systems. In this study, novel frequency and cepstral domain equalizations that reduce the impact of LE on ASR are proposed. Short-time spectra of LE speech are transformed towards neutral ASR models in a maximum likelihood fashion. Dynamics of cepstral coefficients are normalized to a constant range using quantile estimations. The algorithms are incorporated in a recognizer employing a codebook of noisy acoustic models. In a recognition task on connected Czech digits presented in various levels of background car noise, the resulting system provides an absolute reduction in word error rate (WER) on 10 dB SNR data of 8.7% and 37.7% for female neutral and LE speech, and of 8.7% and 32.8% for male neutral and LE speech when compared to the baseline system employing perceptual linear prediction (PLP) coefficients and cepstral mean and variance normalization.
引用
收藏
页码:3937 / 3940
页数:4
相关论文
共 50 条
  • [1] Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments
    Boril, Hynek
    Hansen, John H. L.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1379 - 1393
  • [2] Reduced Complexity Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments
    Boril, Hynek
    Hansen, John H. L.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1235 - 1238
  • [3] Noisy Lombard and Loud speech compensation approach for speech recognition in extremely adverse environment
    Tian, Bin
    Yi, Kechu
    Shengxue Xuebao/Acta Acustica, 2003, 28 (01): : 28 - 32
  • [4] Lombard effect compensation and noise suppression for noisy Lombard speech recognition
    Chi, SM
    Oh, YH
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2013 - 2016
  • [5] NONLINEAR CEPSTRAL EQUALIZATION METHOD FOR NOISY SPEECH RECOGNITION
    LEE, LM
    CHEN, JK
    WANG, HC
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1994, 141 (06): : 397 - 402
  • [6] Lombard effect in Algerian Dialect Speech: Investigation of boosting and bypass strategies in narrowband noisy environment
    Ykhlef, Faycal
    Bouchaffra, Djamel
    APPLIED ACOUSTICS, 2022, 197
  • [7] Speech emotion recognition in noisy environment
    Chenchah, Farah
    Lachiri, Zied
    2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 788 - 792
  • [8] Study of speech recognition in noisy environment
    Kreisinger, T
    Pollak, P
    Sovka, P
    Uhlir, J
    SIGNAL ANALYSIS & PREDICTION I, 1997, : 334 - 337
  • [9] SPEECH RECOGNITION IN THE NOISY CAR ENVIRONMENT
    RUEHL, HW
    DOBLER, S
    WEITH, J
    MEYER, P
    NOLL, A
    HAMER, HH
    PIOTROWSKI, H
    SPEECH COMMUNICATION, 1991, 10 (01) : 11 - 22
  • [10] AUTOMATIC SPEECH RECOGNITION IN A NOISY AUTOMOTIVE ENVIRONMENT
    WILPON, JG
    RABINER, LR
    DEMARCO, D
    SHIPLEY, KL
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 81 : S94 - S94