Transformation-based GMM with improved cluster algorithm for speaker identification

被引:0
|
作者
Xu, Limin [1 ]
Tang, Zhenmin [1 ]
He, Keke [1 ]
Qian, Bo [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China
关键词
gaussian mixture model (GMM); improved cluster algorithm; linear transformation; expectation-maximization (EM) algorithm;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The embedded linear transformation is a popular technique which integrates both transformation and diagonal-covariance Gaussian mixture into a unified framework to improve the performance of speaker recognition. However, the mixture number of GMM must be given in model training. The cluster expectation-maximization (EM) algorithm is a well-known technique in which the mixture number is regarded as an estimated parameter. This paper presents a new model that integrates an improved cluster algorithm into the estimating process of GMM with the embedded transformation. In the approach, the transformation matrix, the mixture number and other traditional model parameters are simultaneously estimated according to a maximum likelihood criterion. The proposed method is demonstrated on a database of three data sessions for text independent speaker identification. The experiments show that this method outperforms the traditional GMM with cluster EM algorithm.
引用
收藏
页码:1006 / +
页数:3
相关论文
共 50 条
  • [41] A hybrid GMM-SVM speaker identification system
    Mashao, DJ
    2004 IEEE AFRICON: 7TH AFRICON CONFERENCE IN AFRICA, VOLS 1 AND 2: TECHNOLOGY INNOVATION, 2004, : 319 - 322
  • [42] Novel Approach in Speaker Identification using SVM and GMM
    Bourouba, H.
    Korba, C. A.
    Djemili, Rafik
    CONTROL ENGINEERING AND APPLIED INFORMATICS, 2013, 15 (03): : 87 - 95
  • [43] Directly modeling of correlation matrices for GMM in speaker identification
    Yao, Zhiqiang
    Zhou, Xi
    Dai, Beiqian
    Liu, Minghui
    Xie, Yanlu
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 306 - +
  • [44] Hybrid KLT/GMM approach for robust speaker identification
    Chen, CCT
    Chen, CT
    Cheng, PW
    ELECTRONICS LETTERS, 2003, 39 (21) : 1552 - 1554
  • [45] A Fuzzy-GMM Classifier For Multilingual Speaker Identification
    Devika, A. K.
    Sumithra, M. G.
    Deepika, A. K.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [46] Speaker and session variability in GMM-based speaker verification
    Kenny, Patrick
    Boulianne, Gilles
    Ouellet, Pierre
    Dumouchel, Pierre
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1448 - 1460
  • [47] Symplectic Geometry Transformation-Based Periodic Segment Method: Algorithm and Applications
    Pan, Haiyang
    Zhang, Ying
    Cheng, Jian
    Zheng, Jinde
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [48] Enhancing GMM speaker identification by incorporating SVM speaker verification for intelligent web-based speech applications
    Ing-Jr Ding
    Chih-Ta Yen
    Multimedia Tools and Applications, 2015, 74 : 5131 - 5140
  • [49] A preliminary study on GMM weight transformation for Emotional Speaker Recognition
    Chen, Li
    Yang, Yingchun
    2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 31 - 36
  • [50] Enhancing GMM speaker identification by incorporating SVM speaker verification for intelligent web-based speech applications
    Ding, Ing-Jr
    Yen, Chih-Ta
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (14) : 5131 - 5140