A Feature Selection Method in Spectro-Temporal Domain Based on Gaussian Mixture Models

被引:0
|
作者
Esfandian, Nafiseh [1 ]
Razzazi, Farbod [2 ]
Behrad, Alireza [3 ]
Valipour, Sara [4 ]
机构
[1] Islamic Azad Univ, Qaemshahr Branch, Fac Engn, Qaemshahr, Iran
[2] Islamic Azad Univ, Fac Engn, Sci & Res Branch, Tehran, Iran
[3] Shahed Univ, Fac Engn, Tehran, Iran
[4] Islamic Azad Univ, Fac Engn, Arak, Iran
关键词
component; Speech recognition; Speech processing; auditory system; Feature extraction; Clustering methods; RECOGNITION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Spectra-temporal representation of speech is considered as one of the leading speech representation approaches in speech recognition systems in recent years. This representation is suffered from high dimensionality of the features space which makes this domain unusable in practical speech recognition systems. In this paper, a new method of feature selection is proposed in the spectro-temporal domain. In this method, clustering techniques are applied to spectro-temporal domain to reduce the dimensions of the features space. In the proposed approach, spectro-temporal space is clustered based on Gaussian Mixture Models (GMMs). The mean vectors and covariance matrices elements of the clusters are considered as a part of the feature vector of the frame. The tests were conducted for new feature vectors on voiced stops (/b/, /d/, /g/) classification of the TIMIT database. Using the new feature vectors, the results were improved to 70.45% which is 7.95% higher than last best results.
引用
收藏
页码:522 / +
页数:2
相关论文
共 50 条
  • [21] Pre-attentive Spectro-temporal Feature Processing in the Human Auditory System
    Zaehle, Tino
    Jancke, Lutz
    Herrmann, Christoph S.
    Meyer, Martin
    [J]. BRAIN TOPOGRAPHY, 2009, 22 (02) : 97 - 108
  • [22] Pre-attentive Spectro-temporal Feature Processing in the Human Auditory System
    Tino Zaehle
    Lutz Jancke
    Christoph S. Herrmann
    Martin Meyer
    [J]. Brain Topography, 2009, 22 : 97 - 108
  • [23] A survey of feature selection methods for Gaussian mixture models and hidden Markov models
    Adams, Stephen
    Beling, Peter A.
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (03) : 1739 - 1779
  • [24] Feature Adaptation Using Linear Spectro-Temporal Transform for Robust Speech Recognition
    Duc Hoang Ha Nguyen
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (06) : 1006 - 1019
  • [25] New approaches for spectro-temporal feature extraction with applications to respiratory sound classification
    Jin, F.
    Sattar, F.
    Goh, D. Y. T.
    [J]. NEUROCOMPUTING, 2014, 123 : 362 - 371
  • [26] A survey of feature selection methods for Gaussian mixture models and hidden Markov models
    Stephen Adams
    Peter A. Beling
    [J]. Artificial Intelligence Review, 2019, 52 : 1739 - 1779
  • [27] POINT PROCESS MODELS OF SPECTRO-TEMPORAL MODULATION EVENTS FOR SPEECH RECOGNITION
    Jansen, Aren
    Mesgarani, Nima
    Niyogi, Partha
    [J]. 2010 CONFERENCE RECORD OF THE FORTY FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2010, : 104 - 108
  • [28] A voice activity detection algorithm in spectro-temporal domain using sparse representation
    Mohadese Eshaghi
    Farbod Razzazi
    Alireza Behrad
    [J]. International Journal of Machine Learning and Cybernetics, 2019, 10 : 1791 - 1803
  • [29] Spectro-Temporal Feature Based Multi-Channel Convolutional Neural Network for ECG Beat Classification
    Hao, Chen
    Wibowo, Sandi
    Majmudar, Maulik
    Rajput, Kuldeep Singh
    [J]. 2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 5642 - 5645
  • [30] Incorporating behavioral and sensory context into spectro-temporal models of auditory encoding
    David, Stephen V.
    [J]. HEARING RESEARCH, 2018, 360 : 107 - 123