Assessing local noise level estimation methods: Application to noise robust ASR

被引:51
|
作者
Ris, C [1 ]
Dupont, S [1 ]
机构
[1] Multitel, Fac Polytech Mons, TCTS, B-7000 Mons, Belgium
关键词
robust automatic speech recognition; noise level estimation; noise reduction; spectral subtraction; missing data;
D O I
10.1016/S0167-6393(00)00051-0
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we assess and compare four methods for the local estimation of noise spectra, namely the energy clustering, the Hirsch histograms, the weighted average method and the low-energy envelope tracking. Moreover we introduce, for these four approaches, the harmonic filtering strategy, a new pre-processing technique, expected to better track fast modulations of the noise energy. The speech periodicity property is used to update the noise level estimate during voiced parts of speech, without explicit detection of voiced portions. Our evaluation is performed with six different kinds of noises (both artificial and real noises) added to clean speech. The best noise level estimation method is then applied to noise robust speech recognition based on techniques requiring a dynamic estimation of the noise spectra, namely spectral subtraction and missing data compensation. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:141 / 158
页数:18
相关论文
共 50 条
  • [21] Natural image noise level estimation based on local statistics for blind noise reduction
    Khmag, Asem
    Ramli, Abd Rahman
    Al-Haddad, S. A. R.
    Kamarudin, Noraziahtulhidayu
    [J]. VISUAL COMPUTER, 2018, 34 (04): : 575 - 587
  • [22] Some solutions to the missing feature problem in data classification, with application to noise robust ASR
    Morris, AC
    Cooke, MP
    Green, PD
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 737 - 740
  • [23] Local estimation of the noise level in MRI using structural adaptation
    Tabelow, Karsten
    Voss, Henning U.
    Polzehl, Joerg
    [J]. MEDICAL IMAGE ANALYSIS, 2015, 20 (01) : 76 - 86
  • [24] Fast and reliable noise level estimation based on local statistic
    Jiang, Ping
    Zhang, Jian-zhou
    [J]. PATTERN RECOGNITION LETTERS, 2016, 78 : 8 - 13
  • [25] A noise-robust ASR front-end using Wiener filter constructed from MMSE estimation of clean speech and noise
    Wu, J
    Droppo, J
    Deng, L
    Acero, A
    [J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 321 - 326
  • [26] Inhibition/enhancement network performance evaluation for noise robust ASR
    Huda, Mohammad Nurul
    Hasan, Mohammad Mahedi
    Hassan, Foyzul
    Kotwal, Mohammed Rokibul Alam
    Gazi Md, Moshfiqul Islam
    Hossain, Md. Shahadat
    Muhammad, Ghulam
    [J]. International Review on Computers and Software, 2010, 5 (05) : 548 - 556
  • [27] Temporal Modulation Processing of Speech Signals for Noise Robust ASR
    You, Hong
    Alwan, Abeer
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 36 - 39
  • [28] EXTENSION OF UNCERTAINTY PROPAGATION TO DYNAMIC MFCCS FOR NOISE ROBUST ASR
    Tran, Dung T.
    Vincent, Emmanuel
    Jouvet, Denis
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [29] Improvements of a dual-input DBN for noise robust ASR
    Sun, Yang
    Gemmeke, Jort E.
    Cranen, Bert
    ten Bosch, Louis
    Boves, Lou
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1680 - 1683
  • [30] FUSION OF MULTIPLE UNCERTAINTY ESTIMATORS AND PROPAGATORS FOR NOISE ROBUST ASR
    Tran, Dung T.
    Vincent, Emmanuel
    Jouvet, Denis
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,