A scale-rate filter selection method in the spectro-temporal domain for phoneme classification

被引:2
|
作者
Fartash, Mehdi [1 ]
Setayeshi, Saeed [2 ,3 ]
Razzazi, Farbod [1 ]
机构
[1] Islamic Azad Univ, Dept Elect & Comp Engn, Sci & Res Branch, Tehran, Iran
[2] Amirkabir Univ Technol, Dept Radiat Med, Tehran, Iran
[3] Amirkabir Univ Technol, Tehran Polytech, Dept Med Radiat Engn, Tehran, Iran
关键词
RECEPTIVE-FIELDS; SPEECH; REPRESENTATIONS;
D O I
10.1016/j.compeleceng.2012.12.013
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, there has been a significant increase in studies employing auditory models in speech recognition systems. In this paper, we propose a new evolutionary tuned feature extraction method by spectro-temporal analysis. In our proposed model, there is a special subspace for each phoneme with a specific best scale in the spectral filter and a specific best rate in the temporal filter. These two parameters were obtained by genetic cellular automata evolutionary algorithm. The extracted features from the specific subspace are classified by a binary one-versus-rest support vector machine. Finally, a multiclass classifier for all phonemes is employed by combining these sub-models. The proposed method improved the discrimination of phonemes significantly especially in highly confusable phonemes. To show the efficiency of the proposed feature sets, it was empirically compared with two baseline models. The achieved relative improvements are about 10% in classification rate for voiced plosives, unvoiced plosives and nasals; and about 7.38% for front vowels relative to the state of the art baseline model. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1537 / 1548
页数:12
相关论文
共 50 条
  • [1] A Novel Spectro-Temporal Feature Extraction Method for Phoneme Classification
    Fartash, Mehdi
    Setayeshi, Saeed
    Razzazi, Farbod
    [J]. 2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 569 - +
  • [2] Phoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain
    Esfandian, N.
    [J]. INTERNATIONAL JOURNAL OF ENGINEERING, 2020, 33 (01): : 105 - 111
  • [3] A Feature Selection Method in Spectro-Temporal Domain Based on Gaussian Mixture Models
    Esfandian, Nafiseh
    Razzazi, Farbod
    Behrad, Alireza
    Valipour, Sara
    [J]. 2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 522 - +
  • [4] A clustering based feature selection method in spectro-temporal domain for speech recognition
    Esfandian, Nafiseh
    Razzazi, Farbod
    Behrad, Alireza
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2012, 25 (06) : 1194 - 1202
  • [5] A Phoneme Recognition Framework based on Auditory Spectro-Temporal Receptive Fields
    Thomas, Samuel
    Patil, Kailash
    Ganapathy, Sriram
    Mesgarani, Nima
    Hermansky, Hynek
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2458 - 2461
  • [6] Nonnegative features of spectro-temporal sounds for classification
    Cho, YC
    Choi, SJ
    [J]. PATTERN RECOGNITION LETTERS, 2005, 26 (09) : 1327 - 1336
  • [7] Spectro-temporal features for environmental sound classification
    Thwe, Khine Zar
    Thaw, Mie Mie
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2019, 20 (02) : 179 - 189
  • [8] SPECTRO-TEMPORAL SUBBAND WIENER FILTER FOR SPEECH ENHANCEMENT
    Hsu, Chung-Chien
    Lin, Tse-En
    Chen, Jian-Hueng
    Chi, Tai-Shih
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4001 - 4004
  • [9] Hilbert Envelope Based Spectro-Temporal Features for Phoneme Recognition in Telephone Speech
    Thomas, Samuel
    Ganapathy, Sriram
    Hermansky, Hynek
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1521 - +
  • [10] Novel Gammatone Filterbank Based Spectro-Temporal Features for Robust Phoneme Recognition
    Nagpal, Ankit
    Patil, Hemant A.
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 342 - 350