Transformation-based GMM with improved cluster algorithm for speaker identification

被引：0

作者：

Xu, Limin ^{[1
]}

Tang, Zhenmin ^{[1
]}

He, Keke ^{[1
]}

Qian, Bo ^{[1
]}

机构：

[1] Nanjing Univ Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China

来源：

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS | 2007年 / 4426卷

关键词：

gaussian mixture model (GMM); improved cluster algorithm; linear transformation; expectation-maximization (EM) algorithm;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The embedded linear transformation is a popular technique which integrates both transformation and diagonal-covariance Gaussian mixture into a unified framework to improve the performance of speaker recognition. However, the mixture number of GMM must be given in model training. The cluster expectation-maximization (EM) algorithm is a well-known technique in which the mixture number is regarded as an estimated parameter. This paper presents a new model that integrates an improved cluster algorithm into the estimating process of GMM with the embedded transformation. In the approach, the transformation matrix, the mixture number and other traditional model parameters are simultaneously estimated according to a maximum likelihood criterion. The proposed method is demonstrated on a database of three data sessions for text independent speaker identification. The experiments show that this method outperforms the traditional GMM with cluster EM algorithm.

引用

页码：1006 / +

页数：3

共 50 条

[1] Speaker identification using multi-step clustering algorithm with transformation-based GMM
Xu L.
Tang Z.
Automatic Control and Computer Sciences, 2007, 41 (04) : 224 - 231
[2] An Improved GMM-based Clustering Algorithm for Efficient Speaker Identification
Lin, Wenyong
PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1490 - 1493
[3] Speaker Cluster based GMM Tokenization for Speaker Recognition
Ma, Bin
Zhu, Donglai
Tong, Rong
Li, Haizhou
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 505 - 508
[4] Research on Adaptive Speaker Identification Based on GMM
Zhou, Yuhuan
Wang, Jinming
Zhang, Xiongwei
2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 2, PROCEEDINGS, 2009, : 330 - 332
[5] GMM based on local PCA for speaker identification
Seo, CW
Lee, KY
Lee, J
ELECTRONICS LETTERS, 2001, 37 (24) : 1486 - 1488
[6] Speaker identification based on GMM with embedded AANN
Chen C.-B.
Zhao L.
Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2010, 32 (03): : 528 - 532
[7] Improved transformation-based quantile regression
Geraci, Marco
Jones, M. C.
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2015, 43 (01): : 118 - 132
[8] Speaker identification based on GMM feature transformation and fuzzy least-squares SVM
Li, Yanping
Tang, Zhenmin
Ding, Hui
Zhang, Yan
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2008, 36 (SUPPL. 1): : 31 - 34
[9] Natural-Emotion GMM Transformation Algorithm for Emotional Speaker Recognition
Shan, Zhenyu
Yang, Yingchun
Ruizhi, Ye
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1629 - 1632
[10] A GMM-Based Speaker Identification System on FPGA
Kan, Phak Len Eh
Allen, Tim
Quigley, Steven F.
RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2010, 5992 : 358 - 363

← 1 2 3 4 5 →