Model-Based Wiener filter for noise robust speech recognition

被引：0

作者：

Arakawa, Takayuki

Tsujikawa, Masanori

Isotani, Ryosuke

机构：

来源：

2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13 | 2006年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we propose a new approach for noise robust speech recognition, which integrates signal-processing-based spectral enhancement and statistical-model-based compensation. The proposed method, Model-Based Wiener filter (MBW), takes three steps to estimate clean speech signals from noisy speech signals, which are corrupted by various kinds of additive background noise. The first step is the well-known spectral subtraction (SS). Since the SS averagely subtracts noise components, the estimated speech signals often include distortion. In the second step, the distortion caused by SS is reduced using the minimum mean square error estimation for a Gaussian mixture model representing pre-trained knowledge of speech. In the final step, the Wiener filtering is performed with the decision-directed method. Experiments are conducted using the Aurora2-J (Japanese digit string) database. The results show that the proposed method performs as well as the ETSI advanced front-end in average and the variation range of the recognition accuracy according to the kind of noise is about one third, which demonstrates the robustness of the proposed method.

引用

页码：537 / 540

页数：4

共 50 条

[31] Noise robust speech recognition with a switching linear dynamic model
Droppo, J
Acero, A
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 953 - 956
[32] Mel-wiener filter for Mel-LPC based speech recognition
Islam, Md. Babul
Yamamoto, Kazumasa
Matsumoto, Hiroshi
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (06): : 935 - 942
[33] A Dynamic Segment Based Statistical Derived PNN Model for Noise Robust Speech Recognition
Junjea, Kapil
[J]. 2015 THIRD INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2015, : 320 - 325
[34] A Novel Model Characteristics for Noise-Robust Automatic Speech Recognition Based on HMM
Rafieee, M. Saadeq
Khazaei, Ali Akbar
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND INFORMATION SECURITY (WCNIS), VOL 2, 2010, : 215 - 218
[35] Robust speaker recognition integrating pitch and wiener filter
Bai, JM
Zheng, R
Xu, B
Zhang, SW
[J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 69 - 72
[36] COMBINING SPECTRAL FEATURE MAPPING AND MULTI-CHANNEL MODEL-BASED SOURCE SEPARATION FOR NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION
Bagchi, Deblin
Mandel, Michael I.
Wang, Zhongqiu
He, Yanzhang
Plummer, Andrew
Fosler-Lussier, Eric
[J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 496 - 503
[37] Signal trajectory based noise compensation for robust speech recognition
Yan, Zhi-Jie
Zhou, Jian-Lai
Soong, Frank
Wang, Ren-Hua
[J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 335 - +
[38] Assessment of signal subspace based speech enhancement for noise robust speech recognition
Hermus, K
Wambacq, P
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 945 - 948
[39] Frequency-domain criterion for the speech distortion weighted multichannel Wiener filter for robust noise reduction
Doclo, Simon
Spriet, Ann
Wouters, Jan
Moonen, Marc
[J]. SPEECH COMMUNICATION, 2007, 49 (7-8) : 636 - 656
[40] Reverberation Model-Based Decoding in the Logmelspec Domain for Robust Distant-Talking Speech Recognition
Sehr, Armin
Maas, Roland
Kellermann, Walter
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1676 - 1691

← 1 2 3 4 5 →