Non-Parametric Vector Quantization of Excitation Source Information for Speaker Recognition

被引：0

作者：

Pati, Debadatta ^{[1
]}

Prasanna, S. R. Mahadeva ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Elect & Commun Engn, Gauhati 781039, India

来源：

2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4 | 2008年

关键词：

speaker information; excitation source; vocal tract; VQ;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The objective of this work is to demonstrate the feasibility of excitation source information obtained by non-parametric Vector Quantization (VQ) for speaker recognition task. Linear Prediction (LP) residual is used as the representation of excitation source information. The LP residual is subjected to non-parametric VQ during training. The codebooks; are built for different codebook sizes. The testing of these codebooks using the LP residual of testing speech data indeed demonstrates that a codebook of sufficiently large size uniquely represents the speaker and provides appreciable performance. The speaker recognition system built using conventional Mel Frequency Cepstral Coefficients (MFCCs) representing vocal tract information combines well with the proposed speaker recognition system using excitation source information to provide improved performance. On a set of randomly chosen 30 speakers from the TIMIT database, the proposed system provides 75%, MFCC based system provides 95% and the combined one provides 98.33%.

引用

页码：1421 / 1424

页数：4

共 50 条

[1] A VECTOR QUANTIZATION APPROACH TO SPEAKER RECOGNITION
SOONG, FK
ROSENBERG, AE
JUANG, BH
RABINER, LR
[J]. AT&T TECHNICAL JOURNAL, 1987, 66 (02): : 14 - 26
[2] Vector quantization of a parametric source
Wolfe, L
[J]. 1997 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING, VOLS 1 AND 2: PACRIM 10 YEARS - 1987-1997, 1997, : 706 - 710
[3] APPLICATIONS OF MFCC AND VECTOR QUANTIZATION IN SPEAKER RECOGNITION
Gupta, Arnav
Gupta, Harshit
[J]. 2013 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND SIGNAL PROCESSING (ISSP), 2013, : 170 - 173
[4] Optimum vector quantization codebook design for speaker recognition
Zhang, XY
Wu, JP
Zhang, YW
Zhang, QS
[J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 1397 - 1402
[5] Speaker Verification Based on Information Theoretic Vector Quantization
Memon, Sheeraz
Lech, Margaret
[J]. WIRELESS NETWORKS, INFORMATION PROCESSING AND SYSTEMS, 2008, 20 : 391 - 399
[6] KNNDIST: A Non-Parametric Distance Measure for Speaker Segmentation
Mohammadi, Seyed Hamidreza
Sameti, Hossein
Langarani, Mahsa Sadat Elyasi
Tavanaei, Amirhossein
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2279 - 2282
[7] A NON-PARAMETRIC ANALYSIS OF RECOGNITION EXPERIMENTS
POLLACK, I
NORMAN, DA
[J]. PSYCHONOMIC SCIENCE, 1964, 1 (05): : 125 - 126
[8] AN ALGORITHM FOR NON-PARAMETRIC PATTERN RECOGNITION
SEBESTYEN, G
EDIE, J
[J]. IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1966, EC15 (06): : 908 - +
[9] Speaker Recognition using Excitation Source Parameters
Kamarauskas, J.
Salna, B.
[J]. ELEKTRONIKA IR ELEKTROTECHNIKA, 2011, (01) : 55 - 58
[10] Speaker Recognition from Excitation Source Perspective
Pati, Debadatta
Prasanna, S. R. Mahadeva
[J]. IETE TECHNICAL REVIEW, 2010, 27 (02) : 138 - 157

← 1 2 3 4 5 →