Feature extraction using non-linear transformation for robust speech recognition on the AURORA database

被引：0

作者：

Sharma, S ^{[1
]}

Ellis, D ^{[1
]}

Kajarekar, S ^{[1
]}

Jain, P ^{[1
]}

Hermansky, H ^{[1
]}

机构：

[1] Intel Corp, Santa Clara, CA 95051 USA

来源：

2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI | 2000年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We evaluate the performance of several feature sets on the AURORA task as defined by ETSI. We show that after a non-linear transformation, a number of features can be effectively used in a HMM-based recognition system. The non-linear transformation is computed using a neural network which is discriminatively trained on the phonetically labeled (forcibly aligned) training data. A combination of the non-linearly transformed PLP, MSG and TRAP features yields a 63% improvement in error rate as compared to a baseline MFCC features. The use of the non-linearly transformed RASTA-like features, with system parameters scaled down to take into account the ETSI imposed memory and latency constraints, still yields a 40% improvement in error rate.

引用

页码：1117 / 1120

页数：4

共 50 条

[41] Robust Feature Extraction Methods for Speech Recognition in Noisy Environments
Mukheolkar, Ajinkya Sunil
Alex, John Sahaya Rani
2014 FIRST INTERNATIONAL CONFERENCE ON NETWORKS & SOFT COMPUTING (ICNSC), 2014, : 295 - 299
[42] A bio-inspired feature extraction for robust speech recognition
Zouhir, Youssef
Ouni, Kais
SPRINGERPLUS, 2014, 3
[43] Temporal modulation normalization for robust speech feature extraction and recognition
Lu, Xugang
Matsuda, Shigeki
Unoki, Masashi
Nakamura, Satoshi
MULTIMEDIA TOOLS AND APPLICATIONS, 2011, 52 (01) : 187 - 199
[44] Temporal modulation normalization for robust speech feature extraction and recognition
Xugang Lu
Shigeki Matsuda
Masashi Unoki
Satoshi Nakamura
Multimedia Tools and Applications, 2011, 52 : 187 - 199
[45] Physiologically Motivated Feature Extraction for Robust Automatic Speech Recognition
Missaoui, Ibrahim
Lachiri, Zied
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (04) : 297 - 301
[46] A Correlational Discriminant Approach to Feature Extraction for Robust Speech Recognition
Tomar, Vikrant Singh
Rose, Richard C.
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 554 - 557
[47] Feature extraction based on auditory representations for robust speech recognition
Kim, DS
Lee, SY
Kil, RM
Zhu, XL
ELECTRONICS LETTERS, 1997, 33 (01) : 15 - 16
[48] An auditory neural feature extraction method for robust speech recognition
Guo, Wei
Zhang, Liqing
Xia, Bin
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 793 - +
[49] A robust feature extraction for automatic speech recognition in noisy environments
Lima, C
Almeida, LB
Monteiro, JL
2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 540 - 543
[50] Temporal modulation normalization for robust speech feature extraction and recognition
Lu, Xugang
Matsuda, Shigeki
Unoki, Masashi
Nakamura, Satoshi
PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4354 - 4357

← 1 2 3 4 5 →