Comparison of Feature Extraction for Accent Dependent Thai Speech Recognition System

被引:0
|
作者
Tantisatirapong, Suchada [1 ]
Prasoproek, Chalisa [1 ]
Phothisonothai, Montri [2 ]
机构
[1] Srinakharinwirot Univ, Dept Biomed Engn, Ongkharak 26120, Nakhon Nayok, Thailand
[2] King Mongkuts Inst Technol Ladkrabang, Int Coll, Bangkok 10520, Thailand
关键词
Accents classification; feature extraction; energy spectral density; power spectral density; mel-frequency cepstral coefficients; spectrogram; support vector machine; CLASSIFICATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper aims to compare the feature extraction methods for accent dependent Thai speech from three regions including central, southern and northeastern regions. We investigate four frequency analysis methods: i.e., Energy Spectral Density (ESD), Power Spectral Density (PSD), Mel-Frequency Cepstral Coefficients (MFCC) and Spectrogram (SPT). Radial basis function kernel based on support vector machine is used as a classifier with 5-fold cross validation. The isolated speech data sets are recorded from 30 male and 30 female participants speaking the 10 Thai digits from 0 to 9. The MFCC-based feature gives better accuracy than ESD, PSD and SPT respectively. For within the same region, the MFCC-based feature provides average accuracy of 94.9% and 99.1% for male and female voices respectively. For the three regions, the MFCC-based feature provides average accuracy of 89.34% and 93.81% for male and female voices, respectively.
引用
下载
收藏
页码:322 / 325
页数:4
相关论文
共 50 条
  • [31] Comparison between different feature extraction techniques for audio-visual speech recognition
    Alin G. Chiţu
    Leon J. M. Rothkrantz
    Pascal Wiggers
    Jacek C. Wojdel
    Journal on Multimodal User Interfaces, 2007, 1 : 7 - 20
  • [32] Combining speech enhancement and auditory feature extraction for robust speech recognition
    Kleinschmidt, M
    Tchorz, J
    Kollmeier, B
    SPEECH COMMUNICATION, 2001, 34 (1-2) : 75 - 91
  • [33] Feature Extraction and Modeling Techniques in Speech Recognition: A Review
    Khan, Usman
    Sarim, Muhammad
    Bin Ahmad, Maaz
    Shafiq, Farhan
    2019 4TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS ENGINEERING (ICISE 2019), 2019, : 63 - 67
  • [34] Acceleration of feature extraction for FPGA based speech recognition
    Arminas, Vytautas
    Tamulevicius, Gintautas
    Navakauskas, Dalius
    Ivanovas, Edgaras
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2010, 2010, 7745
  • [35] Speech recognition with emphasis on wavelet based feature extraction
    Farooq, O
    Datta, S
    IETE JOURNAL OF RESEARCH, 2002, 48 (01) : 3 - 13
  • [36] A Review of Feature Extraction and Classification Techniques in Speech Recognition
    Yadav S.
    Kumar A.
    Yaduvanshi A.
    Meena P.
    SN Computer Science, 4 (6)
  • [37] The application of the additive model in the feature extraction of speech recognition
    Xi, WB
    Fang, L
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 753 - 756
  • [38] MVDR based feature extraction for robust speech recognition
    Dharanipragada, S
    Rao, BD
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 309 - 312
  • [39] Design of Feature Extraction Circuit for Speech Recognition Applications
    Saambhavi, V. B.
    Rao, S. S. S. P.
    Rajalakshmi, P.
    TENCON 2012 - 2012 IEEE REGION 10 CONFERENCE: SUSTAINABLE DEVELOPMENT THROUGH HUMANITARIAN TECHNOLOGY, 2012,
  • [40] Tandem connectionist feature extraction for conversational speech recognition
    Zhu, QF
    Chen, B
    Morgan, N
    Stolcke, A
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2005, 3361 : 223 - 231