Two-stage noise spectra estimation and regression based in-car speech recognition using single distant microphone

被引：0

作者：

Li, WF ^{[1
]}

Itou, K ^{[1
]}

Takeda, K ^{[1
]}

Itakura, F ^{[1
]}

机构：

[1] Nagoya Univ, Grad Sch Engn, Nagoya, Aichi 4648603, Japan

来源：

2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a two-stage noise spectra estimation approach. After the first-stage noise estimation using the improved minima controlled recursive averaging (IMCRA) method, the second-stage noise estimation is performed by employing a maximum a posteriori (MAP) noise amplitude estimator. We also develop a regression-based speech enhance system by approximating the clean speech with the estimated noise and original noisy speech. Evaluation experiments show that the proposed two-stage noise estimation method results in lower estimation error for all test noise types. Compared to original noisy speech, the proposed regression-based approach obtains an average relative word error rate (WER) reduction of 65% in our isolated word recognition experiments conducted in 12 real car environments.

引用

页码：533 / 536

页数：4

共 50 条

[1] Improved noise spectra estimation and log-spectral regression for in-car speech recognition
Li, W. (lee@sp.m.is.nagoya-u.ac.jp), Information Processing Society of Japan, IPSJ; The Database Society of Japan, DBSJ; The IEEE Computer Society; The Inst. of Elec., Info. and Com. Engineers, IEICE (IEEE Computer Society):
[2] Adaptive regression based framework for in-car speech recognition
Li, Weifeng
Itou, Katunobu
Takeda, Kazuya
Itakura, Fumitada
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 501 - 504
[3] Multiple regression of log spectra for in-car speech recognition using multiple distributed microphones
Li, WF
Shinde, T
Fujimura, H
Miyajima, C
Nishino, T
Itou, K
Takeda, K
Itakura, F
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03) : 384 - 390
[4] AN FLMS BASED TWO-MICROPHONE SPEECH ENHANCEMENT SYSTEM FOR IN-CAR APPLICATIONS
Freudenberger, Juergen
Stenzel, Sebastian
Venditti, Benjamin
2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 704 - 707
[5] Adaptive nonlinear regression using multiple distributed microphones for in-car speech recognition
Li, WF
Miyajima, C
Nishino, T
Itou, K
Takeda, K
Itakura, F
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1716 - 1723
[6] A New Two-Stage Method for Single-Microphone Speech Dereverberation
Baghaki, Ali
Ahmad, M. Omair
Swamy, M. N. S.
2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 778 - 781
[7] MAGNITUDE OR PHASE? A TWO-STAGE ALGORITHM FOR SINGLE-MICROPHONE SPEECH DEREVERBERATION
Schwartz, Ayal
Gannot, Sharon
Ghazan, Shlomo E.
2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024, 2024, : 454 - 458
[8] Adaptive log-spectral regression for in-car speech recognition using multiple distributed microphones
Li, WF
Takeda, K
Itakura, F
IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (04) : 340 - 343
[9] Fast Noise Level Estimation Algorithm Based on Two-Stage Support Vector Regression
Xu S.
Zeng X.
Tang Y.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2018, 30 (03): : 447 - 458
[10] Two-Stage Enhancement of Noisy and Reverberant Microphone Array Speech for Automatic Speech Recognition Systems Trained with Only Clean Speech
Wang, Quandong
Wang, Sicheng
Ge, Fengpei
Han, Chang Woo
Lee, Jaewon
Guo, Lianghao
Lee, Chin-Hui
2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 21 - 25

← 1 2 3 4 5 →