Model-Based Wiener filter for noise robust speech recognition

被引:0
|
作者
Arakawa, Takayuki
Tsujikawa, Masanori
Isotani, Ryosuke
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new approach for noise robust speech recognition, which integrates signal-processing-based spectral enhancement and statistical-model-based compensation. The proposed method, Model-Based Wiener filter (MBW), takes three steps to estimate clean speech signals from noisy speech signals, which are corrupted by various kinds of additive background noise. The first step is the well-known spectral subtraction (SS). Since the SS averagely subtracts noise components, the estimated speech signals often include distortion. In the second step, the distortion caused by SS is reduced using the minimum mean square error estimation for a Gaussian mixture model representing pre-trained knowledge of speech. In the final step, the Wiener filtering is performed with the decision-directed method. Experiments are conducted using the Aurora2-J (Japanese digit string) database. The results show that the proposed method performs as well as the ETSI advanced front-end in average and the variation range of the recognition accuracy according to the kind of noise is about one third, which demonstrates the robustness of the proposed method.
引用
收藏
页码:537 / 540
页数:4
相关论文
共 50 条
  • [21] Incomplete spectrogram reconstruction with kalman filter for noise robust speech recognition
    Mohammadi, Arash
    Almasganj, Farshad
    Sadrieh, Nima
    Zandi, Alireza
    [J]. 2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 814 - +
  • [22] Unsupervised modulation filter learning for noise-robust speech recognition
    Agrawal, Purvi
    Ganapathy, Sriram
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (03): : 1686 - 1692
  • [23] GMM-Based two-stage mel-warped Wiener filter for robust speech recognition
    Lei, Jianjun
    Guo, Jun
    Liu, Gang
    Wang, Jian
    Shen, Halfeng
    Nie, Xiangfei
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 827 - 830
  • [24] A SPARSITY BASED PREPROCESSING FOR NOISE ROBUST SPEECH RECOGNITION
    Koniaris, Christos
    Chatterjee, Saikat
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 513 - 518
  • [25] NOISE IDENTIFICATION FOR MODEL-BASED SPEECH ENHANCEMENT
    Jiang Wenbin
    Ying Rendong
    Liu Peilin
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 478 - 483
  • [26] Noise Reduction and Speech Enhancement Using Wiener Filter
    Nuha, Hilal H.
    Absa, Ahmad Abo
    [J]. 2022 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ITS APPLICATIONS (ICODSA), 2022, : 177 - 180
  • [27] MODEL-BASED DEREVERBERATION IN THE LOGMELSPEC DOMAIN FOR ROBUST DISTANT-TALKING SPEECH RECOGNITION
    Sehr, Armin
    Maas, Roland
    Kellermann, Walter
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4298 - 4301
  • [28] GEVD Based Speech and Noise Correlation Matrix Estimation for Multichannel Wiener Filter Based Noise Reduction
    Van Rompaey, Robbe
    Moonen, Marc
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2544 - 2548
  • [29] Statistical Model-Based Noise Reduction Approach for Car Interior Applications to Speech Recognition
    Lee, Sung Joo
    Kang, Byung Ok
    Jung, Ho-Young
    Lee, Yunkeun
    Kim, Hyung Soon
    [J]. ETRI JOURNAL, 2010, 32 (05) : 801 - 809
  • [30] An engineering model of the masking for the noise-robust speech recognition
    Park, KY
    Lee, SY
    [J]. NEUROCOMPUTING, 2003, 52-4 : 615 - 620