A Fast Speaker Indexing Using Vector Quantization and Second Order Statistics with Adaptive Threshold Computation

被引:0
|
作者
Biatov, Konstantin [1 ]
机构
[1] Biatov Lab, St Augustin, Germany
关键词
speaker indexing; speaker clustering; Bayesian Information Criterion; vector quantization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes an effective unsupervised speaker indexing approach. We suggest a two stage algorithm to speed-up the state-of-the-art algorithm based on the Bayesian Information Criterion (BIC). In the first stage of the merging process a computationally cheap method based on the vector quantization (VQ) is used. Then in the second stage a more computational expensive technique based on the BIC is applied. In the speaker indexing task a turning parameter or a threshold is used. We suggest an on-line procedure to define the value of a turning parameter without using development data. The results are evaluated using ESTER corpus.
引用
收藏
页码:1453 / 1456
页数:4
相关论文
共 50 条
  • [1] Fast Computation of Speaker Characterization Vector using MLLR and Sufficient Statistics in Anchor Model Framework
    Sarkar, A. K.
    Umesh, S.
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2746 - 2749
  • [2] Adaptive vector quantization using generalized threshold replenishment
    Fowler, JE
    Ahalt, SC
    [J]. DCC '97 : DATA COMPRESSION CONFERENCE, PROCEEDINGS, 1997, : 317 - 326
  • [3] Fast second-order distributed consensus with adaptive quantization
    Peng Huanxin
    Wang Wenkai
    [J]. 2012 POWER ENGINEERING AND AUTOMATION CONFERENCE (PEAM), 2012, : 736 - 739
  • [4] Audio data indexing : use of second-order statistics for speaker-based segmentation
    Delacourt, P
    Wellekens, C
    [J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 2, 1999, : 959 - 963
  • [5] An Adaptive Threshold Computation for Unsupervised Speaker Segmentation
    Docio-Fernandez, Laura
    Lopez-Otero, Paula
    Garcia-Mateo, Carmen
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 860 - 863
  • [6] Fast GMM Computation for Speaker Verification Using Scalar Quantization and Discrete Densities
    Ye, Guoli
    Mak, Brian
    Mak, Man-Wai
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2291 - +
  • [7] SPEAKER ADAPTIVE VECTOR QUANTIZATION OF LPC PARAMETERS OF SPEECH
    LEE, KY
    KONDOZ, AM
    EVANS, BG
    [J]. ELECTRONICS LETTERS, 1988, 24 (22) : 1392 - 1393
  • [8] Adaptive vector quantization of image sequences using generalized threshold replenishment
    Fowler, JE
    Ahalt, SC
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 3085 - 3088
  • [9] Image and video indexing using vector quantization
    F. Idris
    S. Panchanathan
    [J]. Machine Vision and Applications, 1997, 10 : 43 - 50
  • [10] Image and video indexing using vector quantization
    Idris, F
    Panchanathan, S
    [J]. MACHINE VISION AND APPLICATIONS, 1997, 10 (02) : 43 - 50