Local fuzzy PCA based GMM with dimension reduction on speaker identification

被引:19
|
作者
Lee, KY [1 ]
机构
[1] Soong Sil Univ, Sch Elect Engn, Seoul 156743, South Korea
关键词
PCA; GMM; fuzzy clustering; speaker identification; dimension reduction;
D O I
10.1016/j.patrec.2004.07.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To reduce the high dimensionality required for training of feature vectors in speaker identification, we propose an efficient GMM based on local PCA with fuzzy clustering. The proposed method firstly partitions the data space into several disjoint clusters by fuzzy clustering, and then performs PCA using the fuzzy covariance matrix on each cluster. Finally, the GMM for speaker is obtained from the transformed feature vectors with reduced dimension in each cluster. Compared to the conventional GMM with diagonal covariance matrix, the proposed method shows faster result with less storage maintaining same performance. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:1811 / 1817
页数:7
相关论文
共 50 条
  • [41] Fractal dimension applied to speaker identification
    Petry, A
    Barone, DAC
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 405 - 408
  • [42] Manifold learning based speaker dependent dimension reduction for robust text independent speaker verification
    Zabihzadeh D.
    Moattar M.H.
    Zabihzadeh, D. (d.zabihzadeh@gmail.com), 1600, Kluwer Academic Publishers (17): : 271 - 280
  • [43] A GMM-based handset selector for channel mismatch compensation with applications to speaker identification
    Yiu, KK
    Mak, MW
    Kung, SY
    ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 1132 - 1137
  • [44] Secondary classification for GMM based speaker recognition
    Pelecanos, Jason
    Povey, Dan
    Ramaswamy, Ganesh
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 109 - 112
  • [45] Speaker recognition based on the combination of GMM and SVDD
    Zhou, Yuhuan
    Zhang, Xiongwei
    Wang, Jinming
    Gong, Yong
    Zhou, Yi
    PRZEGLAD ELEKTROTECHNICZNY, 2011, 87 (03): : 329 - 332
  • [46] On the use of PCA in GMM and AR-vector models for text independent speaker verification
    de Lima, CB
    Alcaim, A
    Apolinario, JA
    DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 595 - 598
  • [47] Speaker Recognition Based on GMM with an Embedded TDNN
    Chen, Cunbao
    Zhao, Li
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2009, 5864 : 746 - 753
  • [48] An Abnormal Phone Identification Model with Meta learning Two-layer Framework Based on PCA Dimension Reduction
    Yuan, Yahan
    Ji, Ke
    Sun, Runyuan
    Ma, Kun
    Chen, Zhenxiang
    Wang, Lin
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 511 - 515
  • [49] A Discriminative Performance Metric for GMM-UBM Speaker Identification
    Dehzangi, Omid
    Ma, Bin
    Chng, Eng Siong
    Li, Haizhou
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2114 - +
  • [50] Speaker Identification Using Discriminative Learning of Large Margin GMM
    Daoudi, Khalid
    Jourani, Reda
    Andre-Obrecht, Regine
    Aboutajdine, Driss
    NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 300 - +