TEXT INDEPENDENT SPEAKER VERIFICATION USING ENHANCED SORTED GAUSSIAN MIXTURE MODEL

被引:0
|
作者
Saeidi, R. [1 ]
Ganchev, T. [2 ]
Mohammadi, H. R. Sadegh [1 ]
机构
[1] Iranian Res Inst Elect Engn, Tehran, Iran
[2] Univ Patras, Wire Commun Lab, GR-26500 Patras, Greece
关键词
Speaker Verification; GMM-UBM; GMM speed-up; sorting functions; sorted GMM;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this work we study an enhanced sorting function for the recently developed Sorted GMM, which is computationally efficient method for implementing the Gaussian mixture model universal background model (GMM-UBM) scheme. The sorted GMM employ partial search and thus has lower computational complexity and relaxed memory requirements when compared to the well-known tree-structured GMM of the same model order. Experimental evaluation of the sorted GMM and its enhanced version was performed on two databases: (1) clean speech in Farsi recorded from TV broadcasts, and (2) telephone quality speech in English (NIST 2002 SIZE one-speaker detection data). The enhanced sorting scheme outperformed the original one, primarily for cases where very high acceleration rates were targeted, in scenarios where there was match between training and testing conditions. However, in mismatched train-test conditions the original sorted GMM performed better. Finally, the sorted GMM proved 14 times faster than the baseline system at the cost of only 0.43 increase in Equal Error Rate.
引用
收藏
页码:1191 / +
页数:3
相关论文
共 50 条
  • [31] Forensic Speaker Verification Using Formant Features and Gaussian Mixture Models
    Becker, Timo
    Jessen, Michael
    Grigoras, Catalin
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1505 - +
  • [32] Extracting additional information from gaussian mixture model probabilities for improved text-independent speaker identification
    Narayanaswamy, B
    Gangadharaiah, R
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 621 - 624
  • [33] Text independent speaker identification with finite multivariate generalised Gaussian mixture model and k-means algorithm
    Sailaja, V.
    Rao, K. Srinivasa
    Reddy, K. V. V. S.
    INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2013, 6 (02) : 119 - 126
  • [34] Text-independent speaker verification using speaker clustering and support vector machines
    Hou, FL
    Wang, BX
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 456 - 459
  • [35] Pseudo speaker models for text-independent speaker verification using rank threshold
    Chiba University, Chiba, Japan
    NLP-KE - Proc. Int. Conf. Nat. Lang. Process. Knowl. Eng., (265-268):
  • [36] A STOCHASTIC FIXED LENGTH SEGMENT MODEL FOR TEXT INDEPENDENT SPEAKER VERIFICATION
    LIU, CS
    WANG, HC
    SIGNAL PROCESSING, 1995, 45 (02) : 183 - 191
  • [37] A tutorial on text-independent speaker verification
    Bimbot, F. (bimbot@irisa.fr), 1600, Hindawi Publishing Corporation (2004):
  • [38] A tutorial on text-independent speaker verification
    Bimbot, F
    Bonastre, JF
    Fredouille, C
    Gravier, G
    Magrin-Chagnolleau, I
    Meignier, S
    Merlin, T
    Ortega-García, J
    Petrovska-Delacrétaz, D
    Reynolds, DA
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) : 430 - 451
  • [39] A Tutorial on Text-Independent Speaker Verification
    Frédéric Bimbot
    Jean-François Bonastre
    Corinne Fredouille
    Guillaume Gravier
    Ivan Magrin-Chagnolleau
    Sylvain Meignier
    Teva Merlin
    Javier Ortega-García
    Dijana Petrovska-Delacrétaz
    Douglas A. Reynolds
    EURASIP Journal on Advances in Signal Processing, 2004
  • [40] Text-independent speaker verification using predictive neural networks
    Finan, RA
    Sapeluk, AT
    Damper, RI
    FIFTH INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS, 1997, (440): : 274 - 279