Discriminant Spectrotemporal Features for Phoneme Recognition

被引：0

作者：

Mesgarani, Nima ^{[1
]}

Sivaram, G. S. V. S. ^{[1
]}

Nemala, Sridhar Krishna ^{[1
]}

Elhilali, Mounya ^{[1
]}

Hermansky, Hynek ^{[1
]}

机构：

[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

phoneme recognition; spectrotemporal filters; data driven features; RECEPTIVE-FIELDS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose discriminant methods for deriving two-dimensional spectrotemporal features for phoneme recognition that are estimated to maximize the separation between the representations of phoneme classes. The linearity of the filters results in their intuitive interpretation enabling us to investigate the working principles of the system and to improve its performance by locating the sources of error. Two methods for the estimation of filters are proposed: Regularized Least Square (RLS) and Modified Linear Discriminant Analysis (MLDA). Both methods reach a comparable improvement over the baseline condition demonstrating the advantage of the discriminant spectrotemporal filters.

引用

页码：2947 / 2950

页数：4

共 50 条

[1] Discriminant neural predictive coding applied to phoneme recognition
Gas, B
Zarader, JL
Chavy, C
Chetouani, M
[J]. NEUROCOMPUTING, 2004, 56 (1-4) : 141 - 166
[2] COMPARISON OF MODULATION FEATURES FOR PHONEME RECOGNITION
Ganapathy, Sriram
Thomas, Samuel
Hermansky, Hynek
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5038 - 5041
[3] Phoneme recognition using wavelet based features
Farooq, O
Datta, S
[J]. INFORMATION SCIENCES, 2003, 150 (1-2) : 5 - 15
[4] Inertia Based Recognition of Daily Activities with ANNs and Spectrotemporal Features
Kilinc, Ozsel
Dalzell, Alexander
Uluturk, Ismail
Uysal, Ismail
[J]. 2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 733 - 738
[5] PHONEME RECOGNITION USING BOOSTED BINARY FEATURES
Roy, Anindya
Magimai-Doss, Mathew
Marcel, Sebastien
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4868 - 4871
[6] Face Recognition by Cognitive Discriminant Features
Firouzian, Iman
Firouzian, Nematallah
[J]. INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2020, 11 (01): : 7 - 20
[7] Modulation frequency features for phoneme recognition in noisy speech
Ganapathy, Sriram
Thomas, Samuel
Hermansky, Hynek
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (01): : EL8 - EL12
[8] LEARNING DYNAMIC FEATURES WITH NEURAL NETWORKS FOR PHONEME RECOGNITION
Zheng, Xin
Wu, Zhiyong
Meng, Helen
Cai, Lianhong
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[9] Robust speech detection based on phoneme recognition features
Mihelic, France
Zibert, Janez
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 455 - 462
[10] Minimum phoneme error based heteroscedastic linear discriminant analysis for speech recognition
Zhang, B
Matsoukas, S
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 925 - 928

← 1 2 3 4 5 →