APPLICATIONS OF MFCC AND VECTOR QUANTIZATION IN SPEAKER RECOGNITION

被引:0
|
作者
Gupta, Arnav [1 ]
Gupta, Harshit [2 ]
机构
[1] Jaypee Inst Informat Technol, Dept Elect & Commun Engn, Noida, India
[2] Delft Univ Technol, Dept Elect Engn, NL-2600 AA Delft, Netherlands
关键词
feature vector; feature modeling; MFCC; VQ; feature extraction; cepstral coefficients;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In speaker recognition, most of the computation originates from the likelihood computations between feature vectors of the unknown speaker and the models in the database. In this paper, we concentrate on optimizing Mel Frequency Cepstral Coefficient (MFCC) for feature extraction and Vector Quantization (VQ) for feature modeling. We reduce the number of feature vectors by pre-quantizing the test sequence prior to matching, and number of speakers by ruling out unlikely speakers during recognition process. The two important parameters, Recognition rate and minimized Average Distance between the samples, depends on the codebook size and the number of cepstral coefficients. We find, that this approach yields significant performance when the changes are made in the number of mfcc's and the codebook size. Recognition rate is found to reach upto 89% and the distortion reduced upto 69%.
引用
收藏
页码:170 / 173
页数:4
相关论文
共 50 条
  • [21] Speaker recognition by combining MFCC and phase information
    Department of Information and Computer Sciences, Toyohashi University of Technology, Japan
    [J]. Int. Speech Commun. Assoc. - Annu. Conf. Int. Speech Commun. Assoc., Interspeech, (1065-1068):
  • [22] Speaker Recognition Based on Dynamic MFCC Parameters
    Wang Yutai
    Li Bo
    Jiang Xiaoqing
    Liu Feng
    Wang Lihao
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND SIGNAL PROCESSING, 2009, : 406 - 409
  • [23] Speaker Verification Using MFCC and Support Vector Machine
    Chen, Shi-Huang
    Luo, Yu-Ren
    [J]. IMECS 2009: INTERNATIONAL MULTI-CONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2009, : 532 - 535
  • [24] Non-Parametric Vector Quantization of Excitation Source Information for Speaker Recognition
    Pati, Debadatta
    Prasanna, S. R. Mahadeva
    [J]. 2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 1421 - 1424
  • [25] On Text-independent Speaker Recognition via Improved Vector Quantization Method
    Liu Ting-ting
    Guan Sheng-xiao
    [J]. 2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 3912 - 3916
  • [26] A Comparison of MFCC and LPCC with Deep Learning for Speaker Recognition
    Yang, Haiyan
    Deng, Yanrong
    Zhao, Hua-An
    [J]. ICBDC 2019: PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON BIG DATA AND COMPUTING, 2019, : 160 - 164
  • [27] TEXT DEPENDENT SPEAKER RECOGNITION USING SHIFTED MFCC
    Mukherjee, Rishiraj
    Islam, Tanmoy
    Sankar, Ravi
    [J]. 2012 PROCEEDINGS OF IEEE SOUTHEASTCON, 2012,
  • [28] TEXT DEPENDENT SPEAKER RECOGNITION USING SHIFTED MFCC
    Mukherjee, Rishiraj
    Islam, Tanmoy
    Sankar, Ravi
    [J]. 2013 PROCEEDINGS OF IEEE SOUTHEASTCON, 2013,
  • [29] LDA combination of pitch and MFCC features in speaker recognition
    Harrag, A
    Mohamadi, T
    Serignat, JF
    [J]. INDICON 2005 Proceedings, 2005, : 237 - 240
  • [30] Speaker Recognition Based on MFCC and BP Neural Networks
    Wang, Yi
    Lawlor, Bob
    [J]. 2017 28TH IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2017,