Feature extraction using non-linear transformation for robust speech recognition on the AURORA database

被引：0

作者：

Sharma, S ^{[1
]}

Ellis, D ^{[1
]}

Kajarekar, S ^{[1
]}

Jain, P ^{[1
]}

Hermansky, H ^{[1
]}

机构：

[1] Intel Corp, Santa Clara, CA 95051 USA

来源：

2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI | 2000年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We evaluate the performance of several feature sets on the AURORA task as defined by ETSI. We show that after a non-linear transformation, a number of features can be effectively used in a HMM-based recognition system. The non-linear transformation is computed using a neural network which is discriminatively trained on the phonetically labeled (forcibly aligned) training data. A combination of the non-linearly transformed PLP, MSG and TRAP features yields a 63% improvement in error rate as compared to a baseline MFCC features. The use of the non-linearly transformed RASTA-like features, with system parameters scaled down to take into account the ETSI imposed memory and latency constraints, still yields a 40% improvement in error rate.

引用

页码：1117 / 1120

页数：4

共 50 条

[31] Robust Feature Combination for Speech Recognition Using Linear Microphone Array in a Car
Obuchi, Yasunari
Hataoka, Nobuo
IN-VEHICLE CORPUS AND SIGNAL PROCESSING FOR DRIVER BEHAVIOR, 2009, : 187 - +
[32] Noisy speech feature estimation on the Aurora2 database using a switching linear dynamic model
Deng, Jianping
Bouchard, Martin
Yeap, Tet Hin
2007, Academy Publisher (02):
[33] A NON-LINEAR OPERATOR BASED METHOD FOR HARMONIC FEATURE EXTRACTION FROM SPEECH SIGNALS
Kavanagh, Darren F.
Boland, Frank
ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 217 - 220
[34] Linear spectral transformation for robust speech recognition using maximum mutual information
Kim, Donghyun
Yook, Dongsuk
IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (07) : 496 - 499
[35] Robust speech processing using local adaptive non-linear filtering
1600, Institution of Engineering and Technology, United States (07):
[36] Robust speech processing using local adaptive non-linear filtering
Diaz-Ramirez, Victor H.
Kober, Vitaly
IET SIGNAL PROCESSING, 2013, 7 (05) : 345 - 359
[37] UNSEEN NOISE ROBUST SPEECH RECOGNITION USING ADAPTIVE PIECEWISE LINEAR TRANSFORMATION
Chijiiwa, Keigo
Suzuki, Masayuki
Minematsu, Nobuaki
Hirose, Keikichi
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4289 - 4292
[38] Robust speech processing using local adaptive non-linear filtering
1600, Institution of Engineering and Technology, United States (07):
[39] Non-linear predictors based on the functionally expanded neural networks for speech feature extraction
Chetouani, Mohamed
Hussain, Amir
Gas, Bruno
Zarader, Jean-Luc
2006 IEEE INTERNATIONAL CONFERENCE ON ENGINEERING OF INTELLIGENT SYSTEMS, 2006, : 1 - +
[40] Radial projections for non-linear feature extraction
Perez-Jimenez, AJ
Perez-Cortes, JC
16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 444 - 447

← 1 2 3 4 5 →