Novel discriminative vector quantization approach for speaker identification

被引:2
|
作者
Zhou, GY
Mikhael, WB
Myers, B
机构
[1] Univ Cent Florida, Dept Elect & Comp Engn, Orlando, FL 32826 USA
[2] Conexant Corp, Ishibashi, Tochigi 32905, Japan
关键词
speaker identification; vector quantization; discriminative weight; feature space segmentation;
D O I
10.1142/S0218126605002404
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A novel Discriminative Vector Quantization method for Speaker Identification (DVQSI) is proposed, and its parameters selection is discussed. In the training mode of this approach, the vector space of speech features is divided into a number of regions. Then, a Vector Quantization (VQ) codebook for each speaker in each region is constructed. For every possible combination of speaker pairs, a discriminative weight is assigned for each region, based on the region's ability to discriminate between the speaker pair. Consequently, the region, which contains a larger distribution difference between the speech feature vector sets of the two speakers in the speaker pair, plays a more important role by assigning it a larger discriminative weight, in identifying the better speaker match from the two speakers. In the testing mode, to identify an unknown speaker, discriminative weighted average VQ distortion pairs are computed for the unknown speaker input waveform. Then, a technique is described that figures out the best match between the unknown waveform and speakers' templates. The proposed DVQSI approach can be considered a generalization of the existing VQ technique for Speaker Identification (VQSI). The method presented here yields better Speaker Identification (SI) accuracy by employing the discriminative weights and space segmentation as design parameters. This is confirmed experimentally. In addition, a computationally efficient implementation of the DVQSI technique is given which uses a tree-structured-like approach to obtain the codebooks.
引用
收藏
页码:581 / 596
页数:16
相关论文
共 50 条
  • [1] Analysis of discriminative vector quantization approach for speaker identification
    Zhou, GY
    Mikhael, WB
    [J]. 8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VI, PROCEEDINGS: IMAGE, ACOUSTIC, SIGNAL PROCESSING AND OPTICAL SYSTEMS, TECHNOLOGIES AND APPLICATIONS, 2004, : 479 - 483
  • [2] Speaker Identification based on Discriminative Vector Quantization
    Zhou, GY
    Mikhael, WB
    [J]. Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 617 - 620
  • [3] Speaker identification based on vector quantization
    Radová, V
    Svenda, Z
    [J]. TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 341 - 344
  • [4] A VECTOR QUANTIZATION APPROACH TO SPEAKER RECOGNITION
    SOONG, FK
    ROSENBERG, AE
    JUANG, BH
    RABINER, LR
    [J]. AT&T TECHNICAL JOURNAL, 1987, 66 (02): : 14 - 26
  • [5] Speaker identification based on adaptive discriminative vector quantisation
    Zhou, G.
    Mikhael, W. B.
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2006, 153 (06): : 754 - 760
  • [6] A modified group vector quantization algorithm for speaker identification
    Abu El-Yazeed, MF
    Kader, NSA
    El-Henawy, MM
    [J]. PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 629 - 632
  • [7] Novel approach in speaker identification using support vector machines
    Rabbani, Navid
    Sedaaghi, Mohammad Hossein
    [J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 1123 - 1126
  • [8] An approach of binary isomorphic quantization for speaker identification
    Junsod, S
    Surarerks, A
    [J]. PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 761 - 764
  • [9] A DISCRIMINATIVE APPROACH FOR SPEAKER SELECTION IN SPEAKER DE-IDENTIFICATION SYSTEMS
    Abou-Zleikha, Mohamed
    Tan, Zheng-Hua
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2102 - 2106
  • [10] Speaker Identification with Vector Quantization and K-Harmonic Means
    Yazici, Mustafa
    Ulutas, Mustafa
    [J]. 2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 2134 - 2137