Local fuzzy PCA based GMM with dimension reduction on speaker identification

被引:19
|
作者
Lee, KY [1 ]
机构
[1] Soong Sil Univ, Sch Elect Engn, Seoul 156743, South Korea
关键词
PCA; GMM; fuzzy clustering; speaker identification; dimension reduction;
D O I
10.1016/j.patrec.2004.07.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To reduce the high dimensionality required for training of feature vectors in speaker identification, we propose an efficient GMM based on local PCA with fuzzy clustering. The proposed method firstly partitions the data space into several disjoint clusters by fuzzy clustering, and then performs PCA using the fuzzy covariance matrix on each cluster. Finally, the GMM for speaker is obtained from the transformed feature vectors with reduced dimension in each cluster. Compared to the conventional GMM with diagonal covariance matrix, the proposed method shows faster result with less storage maintaining same performance. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:1811 / 1817
页数:7
相关论文
共 50 条
  • [21] Robust PCA-GMM-SVM System for Speaker Verification Task
    Zergat, Kawthar Yasmine
    Amrouche, Abderrahmane
    Asbai, Nassim
    Debyeche, Mohamed
    8TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS 2012), 2012, : 214 - 217
  • [22] Gender Identification of a Speaker Using MFCC and GMM
    Yucesoy, Ergun
    Nabiyev, Vasif V.
    2013 8TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO), 2013, : 626 - 629
  • [23] A hybrid GMM/SVM approach to speaker identification
    Fine, S
    Navrátil, J
    Gopinath, RA
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 417 - 420
  • [24] Efficient speaker identification based on robust VQ-PCA
    Lee, Y
    Lee, J
    Lee, KY
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2003, PT 2, PROCEEDINGS, 2003, 2668 : 631 - 638
  • [25] Speaker Cluster based GMM Tokenization for Speaker Recognition
    Ma, Bin
    Zhu, Donglai
    Tong, Rong
    Li, Haizhou
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 505 - 508
  • [26] Dimension Compactness in Speaker Identification
    Kanrar, Soumen
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
  • [27] Closed-set speaker identification using VQ and GMM based models
    Barai, Bidhan
    Chakraborty, Tapas
    Das, Nibaran
    Basu, Subhadip
    Nasipuri, Mita
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (01) : 173 - 196
  • [28] Robust Speaker Identification Based On Hybrid Model of VQ and GMM-UBM
    Nguyen, Vu X.
    Nguyen, Vu P. H.
    Pham, Tuan V.
    2015 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2015, : 490 - 495
  • [29] Closed-set speaker identification using VQ and GMM based models
    Bidhan Barai
    Tapas Chakraborty
    Nibaran Das
    Subhadip Basu
    Mita Nasipuri
    International Journal of Speech Technology, 2022, 25 : 173 - 196
  • [30] A hybrid GMM-SVM speaker identification system
    Mashao, DJ
    2004 IEEE AFRICON: 7TH AFRICON CONFERENCE IN AFRICA, VOLS 1 AND 2: TECHNOLOGY INNOVATION, 2004, : 319 - 322