Novel discriminative vector quantization approach for speaker identification

被引：2

作者：

Zhou, GY

Mikhael, WB

Myers, B

机构：

[1] Univ Cent Florida, Dept Elect & Comp Engn, Orlando, FL 32826 USA

[2] Conexant Corp, Ishibashi, Tochigi 32905, Japan

来源：

JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS | 2005年 / 14卷 / 03期

关键词：

speaker identification; vector quantization; discriminative weight; feature space segmentation;

D O I：

10.1142/S0218126605002404

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

A novel Discriminative Vector Quantization method for Speaker Identification (DVQSI) is proposed, and its parameters selection is discussed. In the training mode of this approach, the vector space of speech features is divided into a number of regions. Then, a Vector Quantization (VQ) codebook for each speaker in each region is constructed. For every possible combination of speaker pairs, a discriminative weight is assigned for each region, based on the region's ability to discriminate between the speaker pair. Consequently, the region, which contains a larger distribution difference between the speech feature vector sets of the two speakers in the speaker pair, plays a more important role by assigning it a larger discriminative weight, in identifying the better speaker match from the two speakers. In the testing mode, to identify an unknown speaker, discriminative weighted average VQ distortion pairs are computed for the unknown speaker input waveform. Then, a technique is described that figures out the best match between the unknown waveform and speakers' templates. The proposed DVQSI approach can be considered a generalization of the existing VQ technique for Speaker Identification (VQSI). The method presented here yields better Speaker Identification (SI) accuracy by employing the discriminative weights and space segmentation as design parameters. This is confirmed experimentally. In addition, a computationally efficient implementation of the DVQSI technique is given which uses a tree-structured-like approach to obtain the codebooks.

引用

页码：581 / 596

页数：16

共 50 条

[1] Analysis of discriminative vector quantization approach for speaker identification
Zhou, GY
Mikhael, WB
[J]. 8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VI, PROCEEDINGS: IMAGE, ACOUSTIC, SIGNAL PROCESSING AND OPTICAL SYSTEMS, TECHNOLOGIES AND APPLICATIONS, 2004, : 479 - 483
[2] Speaker Identification based on Discriminative Vector Quantization
Zhou, GY
Mikhael, WB
[J]. Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 617 - 620
[3] Speaker identification based on vector quantization
Radová, V
Svenda, Z
[J]. TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 341 - 344
[4] A VECTOR QUANTIZATION APPROACH TO SPEAKER RECOGNITION
SOONG, FK
ROSENBERG, AE
JUANG, BH
RABINER, LR
[J]. AT&T TECHNICAL JOURNAL, 1987, 66 (02): : 14 - 26
[5] Speaker identification based on adaptive discriminative vector quantisation
Zhou, G.
Mikhael, W. B.
[J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2006, 153 (06): : 754 - 760
[6] A modified group vector quantization algorithm for speaker identification
Abu El-Yazeed, MF
Kader, NSA
El-Henawy, MM
[J]. PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 629 - 632
[7] Novel approach in speaker identification using support vector machines
Rabbani, Navid
Sedaaghi, Mohammad Hossein
[J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 1123 - 1126
[8] An approach of binary isomorphic quantization for speaker identification
Junsod, S
Surarerks, A
[J]. PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 761 - 764
[9] A DISCRIMINATIVE APPROACH FOR SPEAKER SELECTION IN SPEAKER DE-IDENTIFICATION SYSTEMS
Abou-Zleikha, Mohamed
Tan, Zheng-Hua
Christensen, Mads Graesboll
Jensen, Soren Holdt
[J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2102 - 2106
[10] Speaker Identification with Vector Quantization and K-Harmonic Means
Yazici, Mustafa
Ulutas, Mustafa
[J]. 2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 2134 - 2137

← 1 2 3 4 5 →