Local fuzzy PCA based GMM with dimension reduction on speaker identification

被引：19

作者：

Lee, KY ^{[1
]}

机构：

[1] Soong Sil Univ, Sch Elect Engn, Seoul 156743, South Korea

来源：

PATTERN RECOGNITION LETTERS | 2004年 / 25卷 / 16期

关键词：

PCA; GMM; fuzzy clustering; speaker identification; dimension reduction;

D O I：

10.1016/j.patrec.2004.07.006

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To reduce the high dimensionality required for training of feature vectors in speaker identification, we propose an efficient GMM based on local PCA with fuzzy clustering. The proposed method firstly partitions the data space into several disjoint clusters by fuzzy clustering, and then performs PCA using the fuzzy covariance matrix on each cluster. Finally, the GMM for speaker is obtained from the transformed feature vectors with reduced dimension in each cluster. Compared to the conventional GMM with diagonal covariance matrix, the proposed method shows faster result with less storage maintaining same performance. (C) 2004 Elsevier B.V. All rights reserved.

引用

页码：1811 / 1817

页数：7

共 50 条

[21] Robust PCA-GMM-SVM System for Speaker Verification Task
Zergat, Kawthar Yasmine
Amrouche, Abderrahmane
Asbai, Nassim
Debyeche, Mohamed
8TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS 2012), 2012, : 214 - 217
[22] Gender Identification of a Speaker Using MFCC and GMM
Yucesoy, Ergun
Nabiyev, Vasif V.
2013 8TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO), 2013, : 626 - 629
[23] A hybrid GMM/SVM approach to speaker identification
Fine, S
Navrátil, J
Gopinath, RA
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 417 - 420
[24] Efficient speaker identification based on robust VQ-PCA
Lee, Y
Lee, J
Lee, KY
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2003, PT 2, PROCEEDINGS, 2003, 2668 : 631 - 638
[25] Speaker Cluster based GMM Tokenization for Speaker Recognition
Ma, Bin
Zhu, Donglai
Tong, Rong
Li, Haizhou
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 505 - 508
[26] Dimension Compactness in Speaker Identification
Kanrar, Soumen
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
[27] Closed-set speaker identification using VQ and GMM based models
Barai, Bidhan
Chakraborty, Tapas
Das, Nibaran
Basu, Subhadip
Nasipuri, Mita
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (01) : 173 - 196
[28] Robust Speaker Identification Based On Hybrid Model of VQ and GMM-UBM
Nguyen, Vu X.
Nguyen, Vu P. H.
Pham, Tuan V.
2015 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2015, : 490 - 495
[29] Closed-set speaker identification using VQ and GMM based models
Bidhan Barai
Tapas Chakraborty
Nibaran Das
Subhadip Basu
Mita Nasipuri
International Journal of Speech Technology, 2022, 25 : 173 - 196
[30] A hybrid GMM-SVM speaker identification system
Mashao, DJ
2004 IEEE AFRICON: 7TH AFRICON CONFERENCE IN AFRICA, VOLS 1 AND 2: TECHNOLOGY INNOVATION, 2004, : 319 - 322

← 1 2 3 4 5 →