APPLICATIONS OF MFCC AND VECTOR QUANTIZATION IN SPEAKER RECOGNITION

被引:0
|
作者
Gupta, Arnav [1 ]
Gupta, Harshit [2 ]
机构
[1] Jaypee Inst Informat Technol, Dept Elect & Commun Engn, Noida, India
[2] Delft Univ Technol, Dept Elect Engn, NL-2600 AA Delft, Netherlands
关键词
feature vector; feature modeling; MFCC; VQ; feature extraction; cepstral coefficients;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In speaker recognition, most of the computation originates from the likelihood computations between feature vectors of the unknown speaker and the models in the database. In this paper, we concentrate on optimizing Mel Frequency Cepstral Coefficient (MFCC) for feature extraction and Vector Quantization (VQ) for feature modeling. We reduce the number of feature vectors by pre-quantizing the test sequence prior to matching, and number of speakers by ruling out unlikely speakers during recognition process. The two important parameters, Recognition rate and minimized Average Distance between the samples, depends on the codebook size and the number of cepstral coefficients. We find, that this approach yields significant performance when the changes are made in the number of mfcc's and the codebook size. Recognition rate is found to reach upto 89% and the distortion reduced upto 69%.
引用
收藏
页码:170 / 173
页数:4
相关论文
共 50 条
  • [1] Speaker Recognition using MFCC, shifted MFCC with Vector Quantization and Fuzzy
    Bansal, Priyanka
    Imam, Syed Akhtar
    Bharti, Roma
    [J]. 2015 INTERNATIONAL CONFERENCE ON SOFT COMPUTING TECHNIQUES AND IMPLEMENTATIONS (ICSCTI), 2015,
  • [2] MFCC and vector quantization for Arabic fricatives Speech/Speaker recognition
    Chelali, Fatma Zohra
    Djeradi, Amar
    [J]. 2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 284 - 289
  • [3] Differential MFCC and Vector Quantization used for Real-Time Speaker Recognition System
    Chen, Wang
    Miao Zhenjiang
    Xiao, Meng
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 319 - 323
  • [4] Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition System
    Omer, Ala Eldin
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMMUNICATION, CONTROL, COMPUTING AND ELECTRONICS ENGINEERING (ICCCCEE), 2017,
  • [5] A VECTOR QUANTIZATION APPROACH TO SPEAKER RECOGNITION
    SOONG, FK
    ROSENBERG, AE
    JUANG, BH
    RABINER, LR
    [J]. AT&T TECHNICAL JOURNAL, 1987, 66 (02): : 14 - 26
  • [6] Using Information Theoretic Vector Quantization for Inverted MFCC based Speaker Verification
    Memon, Sheeraz
    Lech, Margaret
    He, Ling
    [J]. 2009 2ND INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND COMMUNICATION, 2009, : 181 - 185
  • [7] Spoken Word Recognition Using MFCC and Learning Vector Quantization
    Djamal, Esmeralda C.
    Nurhamidah, Neneng
    Ilyas, Ridwan
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTER SCIENCE AND INFORMATICS (EECSI), 2017, : 246 - 251
  • [8] Optimum vector quantization codebook design for speaker recognition
    Zhang, XY
    Wu, JP
    Zhang, YW
    Zhang, QS
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 1397 - 1402
  • [9] Speaker Dependent Speech Emotion Recognition using MFCC and Support Vector Machine
    Dahake, Prajakta P.
    Shaw, Kailash
    Malathi, P.
    [J]. 2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 1080 - 1084
  • [10] On the Use of MFCC Feature Vector Clustering for Efficient Text Dependent Speaker Recognition
    Samal, Ankit
    Parida, Deebyadeep
    Satapathy, Mihir Ranjan
    Mohanty, Mihir Narayan
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2013, 2014, 247 : 305 - 312