Performance Analysis of Speaker Identification using Gaussian Mixture Model and Support Vector Machine

被引:1
|
作者
Verma, Aman Ranjan [1 ]
Singh, S. Premananda [1 ]
Mishra, Ramesh Ch [1 ]
Katta, Kanchana [1 ]
机构
[1] IIIT Manipur, Dept ECE, Imphal, Manipur, India
关键词
Speaker Identification; GMM; SVM; MFCC; Voxforge;
D O I
10.1109/wiecon-ece48653.2019.9019970
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a comparative analysis of textindependent speaker identification techniques with Gaussian Mixture Model (GMM) and Support Vector Machine (SVM) separately. As feature vector, it uses Mel-frequency cepstral coefficients (MFCC), its delta derivatives (Delta MFCC) and double delta derivatives (Delta Delta MFCC). The aim is to test the accuracy of the proposed models for different sizes of MFCC feature vector on fine-tuning GMM and SVM models. The proposed experimental setup with a frame overlap of 75% and MFCC feature size of 20-MFCC + 20-Delta Delta MFCC + 20-Delta Delta MFCC coefficients perform better on SVM than GMM. On prepared data set, the utmost accuracy of the model built using SVM is 100% whereas that of GMM is 95.74%. The data set is prepared using voice samples of native speakers taken from Voxforge website and personal survey. The diversity in the speech corpus shows that the method performs equally well irrespective of language, gender, age, and region.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] A Combination of Gaussian Mixture Model and Support Vector Machine for Speaker Verification
    Quoc Nguyen Viet
    Bao Hung Tran
    Bang Nguyen Phuong
    Duc Lung Vu
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON MEDICAL MEASUREMENTS AND APPLICATIONS (MEMEA), 2017, : 432 - 436
  • [2] A Combination Approach of Gaussian Mixture Models and Support Vector Machines for Speaker Identification
    Djemili, Rafik
    Bourouba, Hocine
    Korba, Amara
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2009, 6 (05) : 490 - 497
  • [3] Gaussian Mixture Reduction Methods Using Support Vector Machine
    Kazama, Haruki
    Tsukiyama, Shuji
    [J]. PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 411 - 416
  • [4] Parameter settings for speaker identification using Gaussian mixture model
    Eskidere, Oe
    Ertas, F.
    [J]. 2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3, 2007, : 495 - +
  • [5] Text Independent Speaker Identification Using Gaussian Mixture Model
    Ting, Chee-Ming
    Salleh, Sh-Hussain
    Tan, Tian-Swee
    Ariff, A. K.
    [J]. ICIAS 2007: INTERNATIONAL CONFERENCE ON INTELLIGENT & ADVANCED SYSTEMS, VOLS 1-3, PROCEEDINGS, 2007, : 194 - 198
  • [6] Vector Quantization Decision Function for Gaussian Mixture Model Based Speaker Identification
    Ahmad, Abdul Manan
    Yee, Loh Mun
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS 2008), 2008, : 363 - 366
  • [7] A bilateral fuzzy support vector machine hybridizing the Gaussian mixture model
    Mohammadi, M.
    Sarmad, M.
    [J]. IRANIAN JOURNAL OF FUZZY SYSTEMS, 2021, 18 (03): : 161 - 177
  • [8] SPEAKER IDENTIFICATION AND VERIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
    REYNOLDS, DA
    [J]. SPEECH COMMUNICATION, 1995, 17 (1-2) : 91 - 108
  • [9] Speaker identification research based on gaussian mixture model
    Chunguang, Han
    Hua, Li
    Jia, Ding
    [J]. 2007 International Symposium on Computer Science & Technology, Proceedings, 2007, : 702 - 705
  • [10] Optimization of Gaussian mixture model parameters for speaker identification
    Hong, QY
    Kwong, S
    Wang, HL
    [J]. GENETIC AND EVOLUTIONARY COMPUTATION GECCO 2004 , PT 2, PROCEEDINGS, 2004, 3103 : 1310 - 1311