Discriminant Spectrotemporal Features for Phoneme Recognition

被引:0
|
作者
Mesgarani, Nima [1 ]
Sivaram, G. S. V. S. [1 ]
Nemala, Sridhar Krishna [1 ]
Elhilali, Mounya [1 ]
Hermansky, Hynek [1 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
关键词
phoneme recognition; spectrotemporal filters; data driven features; RECEPTIVE-FIELDS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose discriminant methods for deriving two-dimensional spectrotemporal features for phoneme recognition that are estimated to maximize the separation between the representations of phoneme classes. The linearity of the filters results in their intuitive interpretation enabling us to investigate the working principles of the system and to improve its performance by locating the sources of error. Two methods for the estimation of filters are proposed: Regularized Least Square (RLS) and Modified Linear Discriminant Analysis (MLDA). Both methods reach a comparable improvement over the baseline condition demonstrating the advantage of the discriminant spectrotemporal filters.
引用
收藏
页码:2947 / 2950
页数:4
相关论文
共 50 条
  • [1] Discriminant neural predictive coding applied to phoneme recognition
    Gas, B
    Zarader, JL
    Chavy, C
    Chetouani, M
    [J]. NEUROCOMPUTING, 2004, 56 (1-4) : 141 - 166
  • [2] COMPARISON OF MODULATION FEATURES FOR PHONEME RECOGNITION
    Ganapathy, Sriram
    Thomas, Samuel
    Hermansky, Hynek
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5038 - 5041
  • [3] Phoneme recognition using wavelet based features
    Farooq, O
    Datta, S
    [J]. INFORMATION SCIENCES, 2003, 150 (1-2) : 5 - 15
  • [4] Inertia Based Recognition of Daily Activities with ANNs and Spectrotemporal Features
    Kilinc, Ozsel
    Dalzell, Alexander
    Uluturk, Ismail
    Uysal, Ismail
    [J]. 2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 733 - 738
  • [5] PHONEME RECOGNITION USING BOOSTED BINARY FEATURES
    Roy, Anindya
    Magimai-Doss, Mathew
    Marcel, Sebastien
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4868 - 4871
  • [6] Face Recognition by Cognitive Discriminant Features
    Firouzian, Iman
    Firouzian, Nematallah
    [J]. INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2020, 11 (01): : 7 - 20
  • [7] Modulation frequency features for phoneme recognition in noisy speech
    Ganapathy, Sriram
    Thomas, Samuel
    Hermansky, Hynek
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (01): : EL8 - EL12
  • [8] LEARNING DYNAMIC FEATURES WITH NEURAL NETWORKS FOR PHONEME RECOGNITION
    Zheng, Xin
    Wu, Zhiyong
    Meng, Helen
    Cai, Lianhong
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] Robust speech detection based on phoneme recognition features
    Mihelic, France
    Zibert, Janez
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 455 - 462
  • [10] Minimum phoneme error based heteroscedastic linear discriminant analysis for speech recognition
    Zhang, B
    Matsoukas, S
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 925 - 928