Comparative Analysis on Different Cepstral Features for Speaker Identification Recognition

被引:0
|
作者
Hanifa, R. M. [1 ]
Isa, K. [1 ]
Mohamad, S. [1 ]
机构
[1] Univ Tun Hussein Onn Malaysia, Fac Elect & Elect Engn, Batu Pahat, Johor, Malaysia
关键词
speaker recognition; cepstral coefficients; MFCC; GFCC; SVM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Speaker recognition is an Artificial Intelligent (AI) technology that lets the machine to process, interpret and respond to human language. In this work, the recorded speech developed from a collection of audio speeches is used as a database. Mel-frequency cepstral coefficients (MFCC) and gammatone frequency cepstral coefficients (GFCC) are two different cepstral features used in this work. These extracted features are then used to train, validate and test the classifier. Support Vector Machine (SVM) is the classifier used in developing the speaker identification system. This classifier is trained to classify the input speech into one of the ethnicity classes: Malay, Chinese, Indian or Bumiputera. The results are based on the two different usages of cepstral features from the same speech utterances by speakers. Finally, the comparative analysis of the speaker identification system is made concerning features and classifier. The results revealed that a combination of GFCC and pitch as the feature vectors (Model 4) produced the highest accuracy rate of 86.1%.
引用
收藏
页码:487 / 492
页数:6
相关论文
共 50 条
  • [21] Speaker identification using Kalman cepstral coefficients
    Svenda, Z
    Radová, V
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 295 - 300
  • [22] Wavelet Packet Based Mel Frequency Cepstral Features for Text Independent Speaker Identification
    Srivastava, Smriti
    Bhardwaj, Saurabh
    Bhandari, Abhishek
    Gupta, Krit
    Bahl, Hitesh
    Gupta, J. R. P.
    [J]. INTELLIGENT INFORMATICS, 2013, 182 : 237 - 247
  • [23] The contribution of cepstral and stylistic features to SRI's 2005 NIST speaker recognition evaluation system
    Ferrer, Luciana
    Shriberg, Elizabeth
    Kajarekar, Sachin S.
    Stolcke, Andreas
    Sonmez, Kemal
    Venkataraman, Anand
    Bratt, Harry
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 101 - 104
  • [24] Application of Shifted Delta Cepstral Features in Speaker Verification
    Calvo, Jose R.
    Fernandez, Rafael
    Hernandez, Gabriel
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 29 - 32
  • [25] iVector Fusion of Prosodic and Cepstral Features for Speaker Verification
    Kockmann, Marcel
    Ferrer, Luciana
    Burget, Lukas
    Cernocky, Jan Honza
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 272 - 275
  • [26] Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification
    Sarkar, Achintya K.
    Cong-Thanh Do
    Le, Viet-Bac
    Barras, Claude
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (09) : 1040 - 1044
  • [27] Subspace Analysis of Spectral Features for Speaker Recognition
    Chen, Ling
    Man, Hong
    Jia, Huading
    Wang, Zhiyi
    Wang, Lei
    Li, Zili
    [J]. 2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 98 - 102
  • [28] Comparative study of several novel acoustic features for speaker recognition
    Pervouchine, Vladimir
    Leedham, Graham
    Zhong, Haishan
    Cho, David
    Li, Haizhou
    [J]. BIOSIGNALS 2008: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON BIO-INSPIRED SYSTEMS AND SIGNAL PROCESSING, VOL 1, 2008, : 220 - 223
  • [29] Speaker recognition method based on deep residual network and improved Power Normalized Cepstral Coefficients features
    He, Runhua
    Li, Pan
    Li, Xuemei
    Chen, Shuhang
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VIRTUAL REALITY, AND VISUALIZATION (AIVRV 2021), 2021, 12153
  • [30] Computer identification of musical instruments using pattern recognition with cepstral coefficients as features
    Brown, JC
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (03): : 1933 - 1941