Adaptive channel normalization based on infornax algorithm for robust speech recognition

被引：1

作者：

Jung, Ho-Young ^{[1
]}

机构：

[1] ETRI, Embedded SW Res Div, Taejon, South Korea

来源：

ETRI JOURNAL | 2007年 / 29卷 / 03期

关键词：

robust speech recognition; adaptive channel normalization; RASTA-like filtering; blind decorrelation; information-maximization method;

D O I：

10.4218/etrij.07.0506.0031

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper proposes a new data-driven method for high-pass approaches, which suppresses slow-varying noise components. Conventional high-pass approaches are based on the idea of decorrelating the feature vector sequence, and are trying for adaptability to various conditions. The proposed method is based on temporal local decorrelation using the information-maximization theory for each utterance. This is performed on an utterance-by-utterance basis, which provides an adaptive channel normalization filter for each condition. The performance of the proposed method is evaluated by isolated-word recognition experiments with channel distortion. Experimental results show that the proposed method yields outstanding improvement for channel-distorted speech recognition.

引用

页码：300 / 304

页数：5

共 50 条

[1] A Robust Feature Normalization Algorithm for Automatic Speech Recognition
Lei, Jianjun
Yang, Zhen
Wang, Jian
[J]. FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 473 - +
[2] Adaptive ARMA filtering and energy normalization for robust speech recognition
Golshan, F.
Ahadi, S. M.
Shariati, S. S.
[J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 1059 - 1062
[3] Power Function-Based Power Distribution Normalization Algorithm for Robust Speech Recognition
Kim, Chanwoo
Stern, Richard M.
[J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 188 - +
[4] Double Gaussian based feature normalization for robust speech recognition
Liu, B
Dai, LR
Li, JY
Wang, RH
[J]. 2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 253 - 256
[5] Normalization of the Speech Modulation Spectra for Robust Speech Recognition
Xiao, Xiong
Chng, Eng Siong
Li, Haizhou
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1662 - 1674
[6] Adaptive Speaker Normalization for CTC-Based Speech Recognition
Ding, Penguin
Guo, Wu
Gu, Bin
Ling, Zhenhua
Du, Jun
[J]. INTERSPEECH 2020, 2020, : 1266 - 1270
[7] SNR-normalization for robust speech recognition
Claes, T
VanCompernolle, D
[J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 331 - 334
[8] Cepstral vector normalization based on stereo data for robust speech recognition
Buera, Luis
Lleida, Eduardo
Miguel, Antonio
Ortega, Alfonso
Saz, Oscar
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 1098 - 1113
[9] Temporal structure normalization of speech feature for robust speech recognition
Xiao, Xiong
Chng, Eng Siong
Li, Haizhou
[J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (07) : 500 - 503
[10] Robust speech recognition with multi-channel codebook dependent cepstral normalization (MCDCN)
Deligne, S
Gopinath, R
[J]. ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 151 - 154

← 1 2 3 4 5 →