A Feature Selection Method in Spectro-Temporal Domain Based on Gaussian Mixture Models

被引:0
|
作者
Esfandian, Nafiseh [1 ]
Razzazi, Farbod [2 ]
Behrad, Alireza [3 ]
Valipour, Sara [4 ]
机构
[1] Islamic Azad Univ, Qaemshahr Branch, Fac Engn, Qaemshahr, Iran
[2] Islamic Azad Univ, Fac Engn, Sci & Res Branch, Tehran, Iran
[3] Shahed Univ, Fac Engn, Tehran, Iran
[4] Islamic Azad Univ, Fac Engn, Arak, Iran
关键词
component; Speech recognition; Speech processing; auditory system; Feature extraction; Clustering methods; RECOGNITION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Spectra-temporal representation of speech is considered as one of the leading speech representation approaches in speech recognition systems in recent years. This representation is suffered from high dimensionality of the features space which makes this domain unusable in practical speech recognition systems. In this paper, a new method of feature selection is proposed in the spectro-temporal domain. In this method, clustering techniques are applied to spectro-temporal domain to reduce the dimensions of the features space. In the proposed approach, spectro-temporal space is clustered based on Gaussian Mixture Models (GMMs). The mean vectors and covariance matrices elements of the clusters are considered as a part of the feature vector of the frame. The tests were conducted for new feature vectors on voiced stops (/b/, /d/, /g/) classification of the TIMIT database. Using the new feature vectors, the results were improved to 70.45% which is 7.95% higher than last best results.
引用
收藏
页码:522 / +
页数:2
相关论文
共 50 条
  • [41] Feature extraction via spectro-temporal analysis of hyperspectral data for vegetative target detection
    Mathur, A
    Bruce, LM
    Robles, W
    Madsen, J
    [J]. 2005 International Workshop on the Analysis on Multi-Temporal Remote Sensing Images, 2005, : 64 - 66
  • [42] Intelligibility assessment of cleft lip and palate speech using Gaussian posteriograms based on joint spectro-temporal features
    Kalita, Sishir
    Prasanna, S. R. Mahadeva
    Dandapat, S.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 144 (04): : 2413 - 2423
  • [43] A clustering-based approach for features extraction in spectro-temporal domain using artificial neural network
    Esfandian, N.
    Hosseinpour, K.
    [J]. International Journal of Engineering, Transactions B: Applications, 2021, 34 (02): : 452 - 457
  • [44] Designing of Gabor filters for spectro-temporal feature extraction to improve the performance of ASR system
    Anirban Dutta
    Gudmalwar Ashishkumar
    Ch. V. Rama Rao
    [J]. International Journal of Speech Technology, 2019, 22 : 1085 - 1097
  • [45] Designing of Gabor filters for spectro-temporal feature extraction to improve the performance of ASR system
    Dutta, Anirban
    Ashishkumar, Gudmalwar
    Rao, Ch V. Rama
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (04) : 1085 - 1097
  • [46] Spectro-temporal modulation energy based mask for robust speaker identification
    Chi, Tai-Shih
    Lin, Ting-Han
    Hsu, Chung-Chien
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (05): : EL368 - EL374
  • [47] Facial feature points detecting based on Gaussian Mixture Models
    Wang, Junnan
    Xiong, Rong
    Chu, Jian
    [J]. PATTERN RECOGNITION LETTERS, 2015, 53 : 62 - 68
  • [48] BLIND ESTIMATION OF REVERBERATION TIME BASED ON SPECTRO-TEMPORAL MODULATION FILTERING
    Xiong, Feifei
    Goetze, Stefan
    Meyer, Bernd T.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 443 - 447
  • [49] Fast Forward Feature Selection of Hyperspectral Images for Classification With Gaussian Mixture Models
    Fauvel, Mathieu
    Dechesne, Clement
    Zullo, Anthony
    Ferraty, Frederic
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2015, 8 (06) : 2824 - 2831
  • [50] Feature selection for pattern classification with Gaussian mixture models: A new objective criterion
    Krishnan, S
    Samudravijaya, K
    Rao, PVS
    [J]. PATTERN RECOGNITION LETTERS, 1996, 17 (08) : 803 - 809