APPLICATIONS OF MFCC AND VECTOR QUANTIZATION IN SPEAKER RECOGNITION

被引：0

作者：

Gupta, Arnav ^{[1
]}

Gupta, Harshit ^{[2
]}

机构：

[1] Jaypee Inst Informat Technol, Dept Elect & Commun Engn, Noida, India

[2] Delft Univ Technol, Dept Elect Engn, NL-2600 AA Delft, Netherlands

来源：

2013 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND SIGNAL PROCESSING (ISSP) | 2013年

关键词：

feature vector; feature modeling; MFCC; VQ; feature extraction; cepstral coefficients;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In speaker recognition, most of the computation originates from the likelihood computations between feature vectors of the unknown speaker and the models in the database. In this paper, we concentrate on optimizing Mel Frequency Cepstral Coefficient (MFCC) for feature extraction and Vector Quantization (VQ) for feature modeling. We reduce the number of feature vectors by pre-quantizing the test sequence prior to matching, and number of speakers by ruling out unlikely speakers during recognition process. The two important parameters, Recognition rate and minimized Average Distance between the samples, depends on the codebook size and the number of cepstral coefficients. We find, that this approach yields significant performance when the changes are made in the number of mfcc's and the codebook size. Recognition rate is found to reach upto 89% and the distortion reduced upto 69%.

引用

页码：170 / 173

页数：4

共 50 条

[21] Speaker recognition by combining MFCC and phase information
Department of Information and Computer Sciences, Toyohashi University of Technology, Japan
[J]. Int. Speech Commun. Assoc. - Annu. Conf. Int. Speech Commun. Assoc., Interspeech, (1065-1068):
[22] Speaker Recognition Based on Dynamic MFCC Parameters
Wang Yutai
Li Bo
Jiang Xiaoqing
Liu Feng
Wang Lihao
[J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND SIGNAL PROCESSING, 2009, : 406 - 409
[23] Speaker Verification Using MFCC and Support Vector Machine
Chen, Shi-Huang
Luo, Yu-Ren
[J]. IMECS 2009: INTERNATIONAL MULTI-CONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2009, : 532 - 535
[24] Non-Parametric Vector Quantization of Excitation Source Information for Speaker Recognition
Pati, Debadatta
Prasanna, S. R. Mahadeva
[J]. 2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 1421 - 1424
[25] On Text-independent Speaker Recognition via Improved Vector Quantization Method
Liu Ting-ting
Guan Sheng-xiao
[J]. 2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 3912 - 3916
[26] A Comparison of MFCC and LPCC with Deep Learning for Speaker Recognition
Yang, Haiyan
Deng, Yanrong
Zhao, Hua-An
[J]. ICBDC 2019: PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON BIG DATA AND COMPUTING, 2019, : 160 - 164
[27] TEXT DEPENDENT SPEAKER RECOGNITION USING SHIFTED MFCC
Mukherjee, Rishiraj
Islam, Tanmoy
Sankar, Ravi
[J]. 2012 PROCEEDINGS OF IEEE SOUTHEASTCON, 2012,
[28] TEXT DEPENDENT SPEAKER RECOGNITION USING SHIFTED MFCC
Mukherjee, Rishiraj
Islam, Tanmoy
Sankar, Ravi
[J]. 2013 PROCEEDINGS OF IEEE SOUTHEASTCON, 2013,
[29] LDA combination of pitch and MFCC features in speaker recognition
Harrag, A
Mohamadi, T
Serignat, JF
[J]. INDICON 2005 Proceedings, 2005, : 237 - 240
[30] Speaker Recognition Based on MFCC and BP Neural Networks
Wang, Yi
Lawlor, Bob
[J]. 2017 28TH IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2017,

← 1 2 3 4 5 →