Local fuzzy PCA based GMM with dimension reduction on speaker identification

被引：19

作者：

Lee, KY ^{[1
]}

机构：

[1] Soong Sil Univ, Sch Elect Engn, Seoul 156743, South Korea

来源：

PATTERN RECOGNITION LETTERS | 2004年 / 25卷 / 16期

关键词：

PCA; GMM; fuzzy clustering; speaker identification; dimension reduction;

D O I：

10.1016/j.patrec.2004.07.006

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To reduce the high dimensionality required for training of feature vectors in speaker identification, we propose an efficient GMM based on local PCA with fuzzy clustering. The proposed method firstly partitions the data space into several disjoint clusters by fuzzy clustering, and then performs PCA using the fuzzy covariance matrix on each cluster. Finally, the GMM for speaker is obtained from the transformed feature vectors with reduced dimension in each cluster. Compared to the conventional GMM with diagonal covariance matrix, the proposed method shows faster result with less storage maintaining same performance. (C) 2004 Elsevier B.V. All rights reserved.

引用

页码：1811 / 1817

页数：7

共 50 条

[41] Fractal dimension applied to speaker identification
Petry, A
Barone, DAC
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 405 - 408
[42] Manifold learning based speaker dependent dimension reduction for robust text independent speaker verification
Zabihzadeh D.
Moattar M.H.
Zabihzadeh, D. (d.zabihzadeh@gmail.com), 1600, Kluwer Academic Publishers (17): : 271 - 280
[43] A GMM-based handset selector for channel mismatch compensation with applications to speaker identification
Yiu, KK
Mak, MW
Kung, SY
ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 1132 - 1137
[44] Secondary classification for GMM based speaker recognition
Pelecanos, Jason
Povey, Dan
Ramaswamy, Ganesh
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 109 - 112
[45] Speaker recognition based on the combination of GMM and SVDD
Zhou, Yuhuan
Zhang, Xiongwei
Wang, Jinming
Gong, Yong
Zhou, Yi
PRZEGLAD ELEKTROTECHNICZNY, 2011, 87 (03): : 329 - 332
[46] On the use of PCA in GMM and AR-vector models for text independent speaker verification
de Lima, CB
Alcaim, A
Apolinario, JA
DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 595 - 598
[47] Speaker Recognition Based on GMM with an Embedded TDNN
Chen, Cunbao
Zhao, Li
NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2009, 5864 : 746 - 753
[48] An Abnormal Phone Identification Model with Meta learning Two-layer Framework Based on PCA Dimension Reduction
Yuan, Yahan
Ji, Ke
Sun, Runyuan
Ma, Kun
Chen, Zhenxiang
Wang, Lin
ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 511 - 515
[49] A Discriminative Performance Metric for GMM-UBM Speaker Identification
Dehzangi, Omid
Ma, Bin
Chng, Eng Siong
Li, Haizhou
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2114 - +
[50] Speaker Identification Using Discriminative Learning of Large Margin GMM
Daoudi, Khalid
Jourani, Reda
Andre-Obrecht, Regine
Aboutajdine, Driss
NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 300 - +

← 1 2 3 4 5 →