Transformation-based GMM with improved cluster algorithm for speaker identification

被引:0
|
作者
Xu, Limin [1 ]
Tang, Zhenmin [1 ]
He, Keke [1 ]
Qian, Bo [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China
关键词
gaussian mixture model (GMM); improved cluster algorithm; linear transformation; expectation-maximization (EM) algorithm;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The embedded linear transformation is a popular technique which integrates both transformation and diagonal-covariance Gaussian mixture into a unified framework to improve the performance of speaker recognition. However, the mixture number of GMM must be given in model training. The cluster expectation-maximization (EM) algorithm is a well-known technique in which the mixture number is regarded as an estimated parameter. This paper presents a new model that integrates an improved cluster algorithm into the estimating process of GMM with the embedded transformation. In the approach, the transformation matrix, the mixture number and other traditional model parameters are simultaneously estimated according to a maximum likelihood criterion. The proposed method is demonstrated on a database of three data sessions for text independent speaker identification. The experiments show that this method outperforms the traditional GMM with cluster EM algorithm.
引用
收藏
页码:1006 / +
页数:3
相关论文
共 50 条
  • [1] Speaker identification using multi-step clustering algorithm with transformation-based GMM
    Xu L.
    Tang Z.
    Automatic Control and Computer Sciences, 2007, 41 (04) : 224 - 231
  • [2] An Improved GMM-based Clustering Algorithm for Efficient Speaker Identification
    Lin, Wenyong
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1490 - 1493
  • [3] Speaker Cluster based GMM Tokenization for Speaker Recognition
    Ma, Bin
    Zhu, Donglai
    Tong, Rong
    Li, Haizhou
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 505 - 508
  • [4] Research on Adaptive Speaker Identification Based on GMM
    Zhou, Yuhuan
    Wang, Jinming
    Zhang, Xiongwei
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 2, PROCEEDINGS, 2009, : 330 - 332
  • [5] GMM based on local PCA for speaker identification
    Seo, CW
    Lee, KY
    Lee, J
    ELECTRONICS LETTERS, 2001, 37 (24) : 1486 - 1488
  • [6] Speaker identification based on GMM with embedded AANN
    Chen C.-B.
    Zhao L.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2010, 32 (03): : 528 - 532
  • [7] Improved transformation-based quantile regression
    Geraci, Marco
    Jones, M. C.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2015, 43 (01): : 118 - 132
  • [8] Speaker identification based on GMM feature transformation and fuzzy least-squares SVM
    Li, Yanping
    Tang, Zhenmin
    Ding, Hui
    Zhang, Yan
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2008, 36 (SUPPL. 1): : 31 - 34
  • [9] Natural-Emotion GMM Transformation Algorithm for Emotional Speaker Recognition
    Shan, Zhenyu
    Yang, Yingchun
    Ruizhi, Ye
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1629 - 1632
  • [10] A GMM-Based Speaker Identification System on FPGA
    Kan, Phak Len Eh
    Allen, Tim
    Quigley, Steven F.
    RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2010, 5992 : 358 - 363