Model-Based Wiener filter for noise robust speech recognition

被引:0
|
作者
Arakawa, Takayuki
Tsujikawa, Masanori
Isotani, Ryosuke
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new approach for noise robust speech recognition, which integrates signal-processing-based spectral enhancement and statistical-model-based compensation. The proposed method, Model-Based Wiener filter (MBW), takes three steps to estimate clean speech signals from noisy speech signals, which are corrupted by various kinds of additive background noise. The first step is the well-known spectral subtraction (SS). Since the SS averagely subtracts noise components, the estimated speech signals often include distortion. In the second step, the distortion caused by SS is reduced using the minimum mean square error estimation for a Gaussian mixture model representing pre-trained knowledge of speech. In the final step, the Wiener filtering is performed with the decision-directed method. Experiments are conducted using the Aurora2-J (Japanese digit string) database. The results show that the proposed method performs as well as the ETSI advanced front-end in average and the variation range of the recognition accuracy according to the kind of noise is about one third, which demonstrates the robustness of the proposed method.
引用
收藏
页码:537 / 540
页数:4
相关论文
共 50 条
  • [31] Noise robust speech recognition with a switching linear dynamic model
    Droppo, J
    Acero, A
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 953 - 956
  • [32] Mel-wiener filter for Mel-LPC based speech recognition
    Islam, Md. Babul
    Yamamoto, Kazumasa
    Matsumoto, Hiroshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (06): : 935 - 942
  • [33] A Dynamic Segment Based Statistical Derived PNN Model for Noise Robust Speech Recognition
    Junjea, Kapil
    [J]. 2015 THIRD INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2015, : 320 - 325
  • [34] A Novel Model Characteristics for Noise-Robust Automatic Speech Recognition Based on HMM
    Rafieee, M. Saadeq
    Khazaei, Ali Akbar
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND INFORMATION SECURITY (WCNIS), VOL 2, 2010, : 215 - 218
  • [35] Robust speaker recognition integrating pitch and wiener filter
    Bai, JM
    Zheng, R
    Xu, B
    Zhang, SW
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 69 - 72
  • [36] COMBINING SPECTRAL FEATURE MAPPING AND MULTI-CHANNEL MODEL-BASED SOURCE SEPARATION FOR NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION
    Bagchi, Deblin
    Mandel, Michael I.
    Wang, Zhongqiu
    He, Yanzhang
    Plummer, Andrew
    Fosler-Lussier, Eric
    [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 496 - 503
  • [37] Signal trajectory based noise compensation for robust speech recognition
    Yan, Zhi-Jie
    Zhou, Jian-Lai
    Soong, Frank
    Wang, Ren-Hua
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 335 - +
  • [38] Assessment of signal subspace based speech enhancement for noise robust speech recognition
    Hermus, K
    Wambacq, P
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 945 - 948
  • [39] Frequency-domain criterion for the speech distortion weighted multichannel Wiener filter for robust noise reduction
    Doclo, Simon
    Spriet, Ann
    Wouters, Jan
    Moonen, Marc
    [J]. SPEECH COMMUNICATION, 2007, 49 (7-8) : 636 - 656
  • [40] Reverberation Model-Based Decoding in the Logmelspec Domain for Robust Distant-Talking Speech Recognition
    Sehr, Armin
    Maas, Roland
    Kellermann, Walter
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1676 - 1691