A Fast Speaker Indexing Using Vector Quantization and Second Order Statistics with Adaptive Threshold Computation

被引：0

作者：

Biatov, Konstantin ^{[1
]}

机构：

[1] Biatov Lab, St Augustin, Germany

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年

关键词：

speaker indexing; speaker clustering; Bayesian Information Criterion; vector quantization;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes an effective unsupervised speaker indexing approach. We suggest a two stage algorithm to speed-up the state-of-the-art algorithm based on the Bayesian Information Criterion (BIC). In the first stage of the merging process a computationally cheap method based on the vector quantization (VQ) is used. Then in the second stage a more computational expensive technique based on the BIC is applied. In the speaker indexing task a turning parameter or a threshold is used. We suggest an on-line procedure to define the value of a turning parameter without using development data. The results are evaluated using ESTER corpus.

引用

页码：1453 / 1456

页数：4

共 50 条

[1] Fast Computation of Speaker Characterization Vector using MLLR and Sufficient Statistics in Anchor Model Framework
Sarkar, A. K.
Umesh, S.
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2746 - 2749
[2] Adaptive vector quantization using generalized threshold replenishment
Fowler, JE
Ahalt, SC
[J]. DCC '97 : DATA COMPRESSION CONFERENCE, PROCEEDINGS, 1997, : 317 - 326
[3] Fast second-order distributed consensus with adaptive quantization
Peng Huanxin
Wang Wenkai
[J]. 2012 POWER ENGINEERING AND AUTOMATION CONFERENCE (PEAM), 2012, : 736 - 739
[4] Audio data indexing : use of second-order statistics for speaker-based segmentation
Delacourt, P
Wellekens, C
[J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 2, 1999, : 959 - 963
[5] An Adaptive Threshold Computation for Unsupervised Speaker Segmentation
Docio-Fernandez, Laura
Lopez-Otero, Paula
Garcia-Mateo, Carmen
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 860 - 863
[6] Fast GMM Computation for Speaker Verification Using Scalar Quantization and Discrete Densities
Ye, Guoli
Mak, Brian
Mak, Man-Wai
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2291 - +
[7] SPEAKER ADAPTIVE VECTOR QUANTIZATION OF LPC PARAMETERS OF SPEECH
LEE, KY
KONDOZ, AM
EVANS, BG
[J]. ELECTRONICS LETTERS, 1988, 24 (22) : 1392 - 1393
[8] Adaptive vector quantization of image sequences using generalized threshold replenishment
Fowler, JE
Ahalt, SC
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 3085 - 3088
[9] Image and video indexing using vector quantization
F. Idris
S. Panchanathan
[J]. Machine Vision and Applications, 1997, 10 : 43 - 50
[10] Image and video indexing using vector quantization
Idris, F
Panchanathan, S
[J]. MACHINE VISION AND APPLICATIONS, 1997, 10 (02) : 43 - 50

← 1 2 3 4 5 →