Comparative Analysis on Different Cepstral Features for Speaker Identification Recognition

被引:0
|
作者
Hanifa, R. M. [1 ]
Isa, K. [1 ]
Mohamad, S. [1 ]
机构
[1] Univ Tun Hussein Onn Malaysia, Fac Elect & Elect Engn, Batu Pahat, Johor, Malaysia
关键词
speaker recognition; cepstral coefficients; MFCC; GFCC; SVM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Speaker recognition is an Artificial Intelligent (AI) technology that lets the machine to process, interpret and respond to human language. In this work, the recorded speech developed from a collection of audio speeches is used as a database. Mel-frequency cepstral coefficients (MFCC) and gammatone frequency cepstral coefficients (GFCC) are two different cepstral features used in this work. These extracted features are then used to train, validate and test the classifier. Support Vector Machine (SVM) is the classifier used in developing the speaker identification system. This classifier is trained to classify the input speech into one of the ethnicity classes: Malay, Chinese, Indian or Bumiputera. The results are based on the two different usages of cepstral features from the same speech utterances by speakers. Finally, the comparative analysis of the speaker identification system is made concerning features and classifier. The results revealed that a combination of GFCC and pitch as the feature vectors (Model 4) produced the highest accuracy rate of 86.1%.
引用
收藏
页码:487 / 492
页数:6
相关论文
共 50 条
  • [1] Cepstral Features and Text-Dependent Speaker Identification A Comparative Study
    Ouzounov, Atanas
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2010, 10 (01) : 3 - 12
  • [2] Reducing the environmental sensitivity of cepstral features for speaker recognition
    Openshaw, JP
    Mason, JS
    [J]. ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 721 - 724
  • [3] Filter bank Based Cepstral Features for Speaker Recognition
    Chougule, Sharada V.
    Chavan, Mahesh S.
    Gaikwad, M. S.
    [J]. 2014 IEEE GLOBAL CONFERENCE ON WIRELESS COMPUTING AND NETWORKING (GCWCN), 2014, : 102 - 106
  • [4] Speaker identification using cepstral analysis
    Nazar, MN
    [J]. ISCON 2002: IEEE STUDENTS CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2002, : 139 - 143
  • [5] Speaker Identification using Warped MVDR Cepstral Features
    Woelfel, Matthias
    Yang, Qian
    Jin, Qin
    Schultz, Tanja
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 904 - +
  • [6] Wavelet packet cepstral analysis for speaker recognition
    Kinney, A
    Stevens, J
    [J]. THIRTY-SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS - CONFERENCE RECORD, VOLS 1 AND 2, CONFERENCE RECORD, 2002, : 206 - 209
  • [7] Variant Time-Frequency Cepstral Features for Speaker Recognition
    Zhang, Wei-Qiang
    Deng, Yan
    He, Liang
    Liu, Jia
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2122 - 2125
  • [8] LANGUAGE-INDEPENDENT CONSTRAINED CEPSTRAL FEATURES FOR SPEAKER RECOGNITION
    Shriberg, Elizabeth
    Stolcke, Andreas
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5296 - 5299
  • [9] A method of Automatic Speaker Recognition using cepstral features and vectorial quantization
    de Lara, JRC
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2005, 3773 : 146 - 153
  • [10] Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition
    Jokic, Ivan D.
    Jokic, Stevan D.
    Delic, Vlado D.
    Peric, Zoran H.
    [J]. 2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 419 - 424