Non-Parametric Vector Quantization of Excitation Source Information for Speaker Recognition

被引:0
|
作者
Pati, Debadatta [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol, Dept Elect & Commun Engn, Gauhati 781039, India
关键词
speaker information; excitation source; vocal tract; VQ;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of this work is to demonstrate the feasibility of excitation source information obtained by non-parametric Vector Quantization (VQ) for speaker recognition task. Linear Prediction (LP) residual is used as the representation of excitation source information. The LP residual is subjected to non-parametric VQ during training. The codebooks; are built for different codebook sizes. The testing of these codebooks using the LP residual of testing speech data indeed demonstrates that a codebook of sufficiently large size uniquely represents the speaker and provides appreciable performance. The speaker recognition system built using conventional Mel Frequency Cepstral Coefficients (MFCCs) representing vocal tract information combines well with the proposed speaker recognition system using excitation source information to provide improved performance. On a set of randomly chosen 30 speakers from the TIMIT database, the proposed system provides 75%, MFCC based system provides 95% and the combined one provides 98.33%.
引用
收藏
页码:1421 / 1424
页数:4
相关论文
共 50 条
  • [1] A VECTOR QUANTIZATION APPROACH TO SPEAKER RECOGNITION
    SOONG, FK
    ROSENBERG, AE
    JUANG, BH
    RABINER, LR
    [J]. AT&T TECHNICAL JOURNAL, 1987, 66 (02): : 14 - 26
  • [2] Vector quantization of a parametric source
    Wolfe, L
    [J]. 1997 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING, VOLS 1 AND 2: PACRIM 10 YEARS - 1987-1997, 1997, : 706 - 710
  • [3] APPLICATIONS OF MFCC AND VECTOR QUANTIZATION IN SPEAKER RECOGNITION
    Gupta, Arnav
    Gupta, Harshit
    [J]. 2013 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND SIGNAL PROCESSING (ISSP), 2013, : 170 - 173
  • [4] Optimum vector quantization codebook design for speaker recognition
    Zhang, XY
    Wu, JP
    Zhang, YW
    Zhang, QS
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 1397 - 1402
  • [5] Speaker Verification Based on Information Theoretic Vector Quantization
    Memon, Sheeraz
    Lech, Margaret
    [J]. WIRELESS NETWORKS, INFORMATION PROCESSING AND SYSTEMS, 2008, 20 : 391 - 399
  • [6] KNNDIST: A Non-Parametric Distance Measure for Speaker Segmentation
    Mohammadi, Seyed Hamidreza
    Sameti, Hossein
    Langarani, Mahsa Sadat Elyasi
    Tavanaei, Amirhossein
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2279 - 2282
  • [7] A NON-PARAMETRIC ANALYSIS OF RECOGNITION EXPERIMENTS
    POLLACK, I
    NORMAN, DA
    [J]. PSYCHONOMIC SCIENCE, 1964, 1 (05): : 125 - 126
  • [8] AN ALGORITHM FOR NON-PARAMETRIC PATTERN RECOGNITION
    SEBESTYEN, G
    EDIE, J
    [J]. IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1966, EC15 (06): : 908 - +
  • [9] Speaker Recognition using Excitation Source Parameters
    Kamarauskas, J.
    Salna, B.
    [J]. ELEKTRONIKA IR ELEKTROTECHNIKA, 2011, (01) : 55 - 58
  • [10] Speaker Recognition from Excitation Source Perspective
    Pati, Debadatta
    Prasanna, S. R. Mahadeva
    [J]. IETE TECHNICAL REVIEW, 2010, 27 (02) : 138 - 157