Comparison of Feature Extraction for Accent Dependent Thai Speech Recognition System

被引:0
|
作者
Tantisatirapong, Suchada [1 ]
Prasoproek, Chalisa [1 ]
Phothisonothai, Montri [2 ]
机构
[1] Srinakharinwirot Univ, Dept Biomed Engn, Ongkharak 26120, Nakhon Nayok, Thailand
[2] King Mongkuts Inst Technol Ladkrabang, Int Coll, Bangkok 10520, Thailand
关键词
Accents classification; feature extraction; energy spectral density; power spectral density; mel-frequency cepstral coefficients; spectrogram; support vector machine; CLASSIFICATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper aims to compare the feature extraction methods for accent dependent Thai speech from three regions including central, southern and northeastern regions. We investigate four frequency analysis methods: i.e., Energy Spectral Density (ESD), Power Spectral Density (PSD), Mel-Frequency Cepstral Coefficients (MFCC) and Spectrogram (SPT). Radial basis function kernel based on support vector machine is used as a classifier with 5-fold cross validation. The isolated speech data sets are recorded from 30 male and 30 female participants speaking the 10 Thai digits from 0 to 9. The MFCC-based feature gives better accuracy than ESD, PSD and SPT respectively. For within the same region, the MFCC-based feature provides average accuracy of 94.9% and 99.1% for male and female voices respectively. For the three regions, the MFCC-based feature provides average accuracy of 89.34% and 93.81% for male and female voices, respectively.
引用
下载
收藏
页码:322 / 325
页数:4
相关论文
共 50 条
  • [1] Phoneme and tonal accent recognition for Thai speech
    Theera-Umpon, Nipon
    Chansareewittaya, Suppakarn
    Auephanwiriyakul, Sansanee
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (10) : 13254 - 13259
  • [2] Feature Extraction Analysis on Indonesian Speech Recognition System
    Wisesty, Untari N.
    Adiwijaya
    Astuti, Widi
    2015 3rd International Conference on Information and Communication Technology (ICoICT), 2015, : 54 - 58
  • [3] The development of the feature extraction algorithms for Thai handwritten character recognition system
    Mitrpanont, JL
    Kiwprasopsak, S
    DEVELOPMENTS IN APPLIED ARTIFICAIL INTELLIGENCE, PROCEEDINGS, 2002, 2358 : 536 - 546
  • [4] Speech recognition as feature extraction for speaker recognition
    Stolcke, A.
    Shriberg, E.
    Ferrer, L.
    Kajarekar, S.
    Sonmez, K.
    Tur, G.
    2007 IEEE WORKSHOP ON SIGNAL PROCESSING APPLICATIONS FOR PUBLIC SECURITY AND FORENSICS, 2007, : 39 - +
  • [5] FEATURE EXTRACTION FOR A SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STUDY\
    Shrawankar, Urmila
    Thakare, Vilas
    2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 1, 2010, : 358 - 361
  • [6] Multiresolution Feature Extraction (MRFE) based speech recognition system
    Priyanka, M. Anbu Swarna
    Solomi, V. Sherlin
    Vijayalakshmi, P.
    Nagarajan, T.
    2013 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2013, : 152 - 156
  • [7] LPC AND LPCC METHOD OF FEATURE EXTRACTION IN SPEECH RECOGNITION SYSTEM
    Gupta, Harshita
    Gupta, Divya
    2016 6th International Conference - Cloud System and Big Data Engineering (Confluence), 2016, : 498 - 502
  • [8] Study of Robust Feature Extraction Techniques for Speech Recognition System
    Sharma, Usha
    Maheshkar, Sushila
    Mishra, A. N.
    2015 1ST INTERNATIONAL CONFERENCE ON FUTURISTIC TRENDS ON COMPUTATIONAL ANALYSIS AND KNOWLEDGE MANAGEMENT (ABLAZE), 2015, : 666 - 670
  • [9] Optimizing feature extraction for speech recognition
    Lee, CH
    Hyun, DH
    Choi, ES
    Go, JW
    Lee, CY
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (01): : 80 - 87
  • [10] Feature extraction for robust speech recognition
    Dharanipragada, S
    2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, PROCEEDINGS, 2002, : 855 - 858