Assessing local noise level estimation methods: Application to noise robust ASR

被引:51
|
作者
Ris, C [1 ]
Dupont, S [1 ]
机构
[1] Multitel, Fac Polytech Mons, TCTS, B-7000 Mons, Belgium
关键词
robust automatic speech recognition; noise level estimation; noise reduction; spectral subtraction; missing data;
D O I
10.1016/S0167-6393(00)00051-0
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we assess and compare four methods for the local estimation of noise spectra, namely the energy clustering, the Hirsch histograms, the weighted average method and the low-energy envelope tracking. Moreover we introduce, for these four approaches, the harmonic filtering strategy, a new pre-processing technique, expected to better track fast modulations of the noise energy. The speech periodicity property is used to update the noise level estimate during voiced parts of speech, without explicit detection of voiced portions. Our evaluation is performed with six different kinds of noises (both artificial and real noises) added to clean speech. The best noise level estimation method is then applied to noise robust speech recognition based on techniques requiring a dynamic estimation of the noise spectra, namely spectral subtraction and missing data compensation. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:141 / 158
页数:18
相关论文
共 50 条
  • [1] DISCRIMINATIVE UNCERTAINTY ESTIMATION FOR NOISE ROBUST ASR
    Tran, Dung T.
    Vincent, Emmanuel
    Jouvet, Denis
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5038 - 5042
  • [2] Noise robust ASR
    Viikki, O
    [J]. SPEECH COMMUNICATION, 2001, 34 (1-2) : 1 - 2
  • [3] Nonparametric Uncertainty Estimation and Propagation for Noise Robust ASR
    Tran, Dung T.
    Vincent, Emmanuel
    Jouvet, Denis
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1835 - 1846
  • [4] Artificial and online acquired noise dictionaries for noise robust ASR
    Gemmeke, Jort F.
    Virtanen, Tuomas
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2082 - +
  • [5] Methods for image noise level estimation
    Novikov, A., I
    Pronkin, A., V
    [J]. COMPUTER OPTICS, 2021, 45 (05) : 713 - +
  • [6] Support Vector Machines for Noise Robust ASR
    Gales, M. J. F.
    Ragni, A.
    AlDamarki, H.
    Gautier, C.
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 205 - 210
  • [7] On the use of wideband signal for noise robust ASR
    Macho, D
    Cheng, YM
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 109 - 112
  • [8] A probability weighted beamformer for noise robust ASR
    Bu, Suliang
    Zhao, Yunxin
    Hwang, Mei-Yuh
    Sun, Sining
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3048 - 3052
  • [9] Application of minimum statistics and minima controlled recursive averaging methods to estimate a cepstral noise model for robust asr
    Stouten, Veronique
    Van Hamme, Hugo
    Wambacq, Patrick
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 765 - 768
  • [10] A noise level estimation method of impulse noise image based on local similarity
    Lin, Cong
    Ye, Youqiang
    Feng, Siling
    Huang, Mengxing
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (11) : 15947 - 15960