Two-stage noise spectra estimation and regression based in-car speech recognition using single distant microphone

被引:0
|
作者
Li, WF [1 ]
Itou, K [1 ]
Takeda, K [1 ]
Itakura, F [1 ]
机构
[1] Nagoya Univ, Grad Sch Engn, Nagoya, Aichi 4648603, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a two-stage noise spectra estimation approach. After the first-stage noise estimation using the improved minima controlled recursive averaging (IMCRA) method, the second-stage noise estimation is performed by employing a maximum a posteriori (MAP) noise amplitude estimator. We also develop a regression-based speech enhance system by approximating the clean speech with the estimated noise and original noisy speech. Evaluation experiments show that the proposed two-stage noise estimation method results in lower estimation error for all test noise types. Compared to original noisy speech, the proposed regression-based approach obtains an average relative word error rate (WER) reduction of 65% in our isolated word recognition experiments conducted in 12 real car environments.
引用
收藏
页码:533 / 536
页数:4
相关论文
共 50 条
  • [1] Improved noise spectra estimation and log-spectral regression for in-car speech recognition
    Li, W. (lee@sp.m.is.nagoya-u.ac.jp), Information Processing Society of Japan, IPSJ; The Database Society of Japan, DBSJ; The IEEE Computer Society; The Inst. of Elec., Info. and Com. Engineers, IEICE (IEEE Computer Society):
  • [2] Adaptive regression based framework for in-car speech recognition
    Li, Weifeng
    Itou, Katunobu
    Takeda, Kazuya
    Itakura, Fumitada
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 501 - 504
  • [3] Multiple regression of log spectra for in-car speech recognition using multiple distributed microphones
    Li, WF
    Shinde, T
    Fujimura, H
    Miyajima, C
    Nishino, T
    Itou, K
    Takeda, K
    Itakura, F
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03) : 384 - 390
  • [4] AN FLMS BASED TWO-MICROPHONE SPEECH ENHANCEMENT SYSTEM FOR IN-CAR APPLICATIONS
    Freudenberger, Juergen
    Stenzel, Sebastian
    Venditti, Benjamin
    2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 704 - 707
  • [5] Adaptive nonlinear regression using multiple distributed microphones for in-car speech recognition
    Li, WF
    Miyajima, C
    Nishino, T
    Itou, K
    Takeda, K
    Itakura, F
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1716 - 1723
  • [6] A New Two-Stage Method for Single-Microphone Speech Dereverberation
    Baghaki, Ali
    Ahmad, M. Omair
    Swamy, M. N. S.
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 778 - 781
  • [7] MAGNITUDE OR PHASE? A TWO-STAGE ALGORITHM FOR SINGLE-MICROPHONE SPEECH DEREVERBERATION
    Schwartz, Ayal
    Gannot, Sharon
    Ghazan, Shlomo E.
    2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024, 2024, : 454 - 458
  • [8] Adaptive log-spectral regression for in-car speech recognition using multiple distributed microphones
    Li, WF
    Takeda, K
    Itakura, F
    IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (04) : 340 - 343
  • [9] Fast Noise Level Estimation Algorithm Based on Two-Stage Support Vector Regression
    Xu S.
    Zeng X.
    Tang Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2018, 30 (03): : 447 - 458
  • [10] Two-Stage Enhancement of Noisy and Reverberant Microphone Array Speech for Automatic Speech Recognition Systems Trained with Only Clean Speech
    Wang, Quandong
    Wang, Sicheng
    Ge, Fengpei
    Han, Chang Woo
    Lee, Jaewon
    Guo, Lianghao
    Lee, Chin-Hui
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 21 - 25