Model-Based Wiener filter for noise robust speech recognition

被引：0

作者：

Arakawa, Takayuki

Tsujikawa, Masanori

Isotani, Ryosuke

机构：

来源：

2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13 | 2006年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we propose a new approach for noise robust speech recognition, which integrates signal-processing-based spectral enhancement and statistical-model-based compensation. The proposed method, Model-Based Wiener filter (MBW), takes three steps to estimate clean speech signals from noisy speech signals, which are corrupted by various kinds of additive background noise. The first step is the well-known spectral subtraction (SS). Since the SS averagely subtracts noise components, the estimated speech signals often include distortion. In the second step, the distortion caused by SS is reduced using the minimum mean square error estimation for a Gaussian mixture model representing pre-trained knowledge of speech. In the final step, the Wiener filtering is performed with the decision-directed method. Experiments are conducted using the Aurora2-J (Japanese digit string) database. The results show that the proposed method performs as well as the ETSI advanced front-end in average and the variation range of the recognition accuracy according to the kind of noise is about one third, which demonstrates the robustness of the proposed method.

引用

页码：537 / 540

页数：4

共 50 条

[21] Incomplete spectrogram reconstruction with kalman filter for noise robust speech recognition
Mohammadi, Arash
Almasganj, Farshad
Sadrieh, Nima
Zandi, Alireza
[J]. 2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 814 - +
[22] Unsupervised modulation filter learning for noise-robust speech recognition
Agrawal, Purvi
Ganapathy, Sriram
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (03): : 1686 - 1692
[23] GMM-Based two-stage mel-warped Wiener filter for robust speech recognition
Lei, Jianjun
Guo, Jun
Liu, Gang
Wang, Jian
Shen, Halfeng
Nie, Xiangfei
[J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 827 - 830
[24] A SPARSITY BASED PREPROCESSING FOR NOISE ROBUST SPEECH RECOGNITION
Koniaris, Christos
Chatterjee, Saikat
[J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 513 - 518
[25] NOISE IDENTIFICATION FOR MODEL-BASED SPEECH ENHANCEMENT
Jiang Wenbin
Ying Rendong
Liu Peilin
[J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 478 - 483
[26] Noise Reduction and Speech Enhancement Using Wiener Filter
Nuha, Hilal H.
Absa, Ahmad Abo
[J]. 2022 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ITS APPLICATIONS (ICODSA), 2022, : 177 - 180
[27] MODEL-BASED DEREVERBERATION IN THE LOGMELSPEC DOMAIN FOR ROBUST DISTANT-TALKING SPEECH RECOGNITION
Sehr, Armin
Maas, Roland
Kellermann, Walter
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4298 - 4301
[28] GEVD Based Speech and Noise Correlation Matrix Estimation for Multichannel Wiener Filter Based Noise Reduction
Van Rompaey, Robbe
Moonen, Marc
[J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2544 - 2548
[29] Statistical Model-Based Noise Reduction Approach for Car Interior Applications to Speech Recognition
Lee, Sung Joo
Kang, Byung Ok
Jung, Ho-Young
Lee, Yunkeun
Kim, Hyung Soon
[J]. ETRI JOURNAL, 2010, 32 (05) : 801 - 809
[30] An engineering model of the masking for the noise-robust speech recognition
Park, KY
Lee, SY
[J]. NEUROCOMPUTING, 2003, 52-4 : 615 - 620

← 1 2 3 4 5 →