A Feature Selection Method in Spectro-Temporal Domain Based on Gaussian Mixture Models

被引:0
|
作者
Esfandian, Nafiseh [1 ]
Razzazi, Farbod [2 ]
Behrad, Alireza [3 ]
Valipour, Sara [4 ]
机构
[1] Islamic Azad Univ, Qaemshahr Branch, Fac Engn, Qaemshahr, Iran
[2] Islamic Azad Univ, Fac Engn, Sci & Res Branch, Tehran, Iran
[3] Shahed Univ, Fac Engn, Tehran, Iran
[4] Islamic Azad Univ, Fac Engn, Arak, Iran
关键词
component; Speech recognition; Speech processing; auditory system; Feature extraction; Clustering methods; RECOGNITION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Spectra-temporal representation of speech is considered as one of the leading speech representation approaches in speech recognition systems in recent years. This representation is suffered from high dimensionality of the features space which makes this domain unusable in practical speech recognition systems. In this paper, a new method of feature selection is proposed in the spectro-temporal domain. In this method, clustering techniques are applied to spectro-temporal domain to reduce the dimensions of the features space. In the proposed approach, spectro-temporal space is clustered based on Gaussian Mixture Models (GMMs). The mean vectors and covariance matrices elements of the clusters are considered as a part of the feature vector of the frame. The tests were conducted for new feature vectors on voiced stops (/b/, /d/, /g/) classification of the TIMIT database. Using the new feature vectors, the results were improved to 70.45% which is 7.95% higher than last best results.
引用
收藏
页码:522 / +
页数:2
相关论文
共 50 条
  • [1] A clustering based feature selection method in spectro-temporal domain for speech recognition
    Esfandian, Nafiseh
    Razzazi, Farbod
    Behrad, Alireza
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2012, 25 (06) : 1194 - 1202
  • [2] A scale-rate filter selection method in the spectro-temporal domain for phoneme classification
    Fartash, Mehdi
    Setayeshi, Saeed
    Razzazi, Farbod
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2013, 39 (05) : 1537 - 1548
  • [3] Music-Genre Classification System based on Spectro-Temporal Features Feature Selection
    Lim, Shin-Cheol
    Lee, Jong-Seol
    Jang, Sei-Jin
    Lee, Soek-Pil
    Kim, Moo Young
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (04) : 1262 - 1268
  • [4] A Novel Spectro-Temporal Feature Extraction Method for Phoneme Classification
    Fartash, Mehdi
    Setayeshi, Saeed
    Razzazi, Farbod
    [J]. 2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 569 - +
  • [5] A hierarchical framework for spectro-temporal feature extraction
    Heckmann, Martin
    Domont, Xavier
    Joublin, Frank
    Goerick, Christian
    [J]. SPEECH COMMUNICATION, 2011, 53 (05) : 736 - 752
  • [6] ROBUST SPECTRO-TEMPORAL FEATURES BASED ON AUTOREGRESSIVE MODELS OF HILBERT ENVELOPES
    Ganapathy, Sriram
    Thomas, Samuel
    Hermansky, Hynek
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4286 - 4289
  • [7] Spectro-temporal processing in the envelope-frequency domain
    Ewert, SD
    Verhey, JL
    Dau, T
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 112 (06): : 2921 - 2931
  • [8] A Tunable picosecond dye laser based on cavity quenching and spectro-temporal selection
    D.Q. Hoa
    N.D. Hung
    T. Imasaka
    N.T. Thanh
    [J]. Applied Physics B, 2004, 79 : 463 - 467
  • [9] Picosecond solid-state dye lasers based on a spectro-temporal selection
    Dao, TTA
    Hoa, DQ
    Nhung, TH
    Ha, TT
    Hung, ND
    [J]. OPTICS FOR THE QUALITY OF LIFE, PTS 1 AND 2, 2003, 4829 : 694 - 696
  • [10] A Tunable picosecond dye laser based on cavity quenching and spectro-temporal selection
    Hoa, DQ
    Hung, ND
    Imasaka, T
    Thanh, NT
    [J]. APPLIED PHYSICS B-LASERS AND OPTICS, 2004, 79 (04): : 463 - 467