Transformation-based GMM with improved cluster algorithm for speaker identification

被引:0
|
作者
Xu, Limin [1 ]
Tang, Zhenmin [1 ]
He, Keke [1 ]
Qian, Bo [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China
关键词
gaussian mixture model (GMM); improved cluster algorithm; linear transformation; expectation-maximization (EM) algorithm;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The embedded linear transformation is a popular technique which integrates both transformation and diagonal-covariance Gaussian mixture into a unified framework to improve the performance of speaker recognition. However, the mixture number of GMM must be given in model training. The cluster expectation-maximization (EM) algorithm is a well-known technique in which the mixture number is regarded as an estimated parameter. This paper presents a new model that integrates an improved cluster algorithm into the estimating process of GMM with the embedded transformation. In the approach, the transformation matrix, the mixture number and other traditional model parameters are simultaneously estimated according to a maximum likelihood criterion. The proposed method is demonstrated on a database of three data sessions for text independent speaker identification. The experiments show that this method outperforms the traditional GMM with cluster EM algorithm.
引用
收藏
页码:1006 / +
页数:3
相关论文
共 50 条
  • [31] Color Space Transformation-Based Smartphone Algorithm for Colorimetric Urinalysis
    Yang, Renbing
    Cheng, Wenbo
    Chen, Xifeng
    Qian, Qin
    Zhang, Qiang
    Pan, Yujun
    Duan, Peng
    Miao, Peng
    ACS OMEGA, 2018, 3 (09): : 12141 - 12146
  • [32] An Improved Transformation-Based Kernel Estimator of Densities on the Unit Interval
    Wen, Kuangyu
    Wu, Ximing
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (510) : 773 - 783
  • [33] Real Number Laplace Transformation-based Identification and its Application
    Suzuki, Satoshi
    Furuta, Katsuhisa
    2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 2048 - 2053
  • [34] Transformation-based estimation
    Feng, Zhenghui
    Wang, Tao
    Zhu, Lixing
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 78 : 186 - 205
  • [35] Improved GMM-UBM/SVM for speaker verification
    Liu, Minghui
    Dai, Beiqian
    Xie, Yanlu
    Yao, Zhiqiang
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 925 - 928
  • [36] Closed-set speaker identification using VQ and GMM based models
    Barai, Bidhan
    Chakraborty, Tapas
    Das, Nibaran
    Basu, Subhadip
    Nasipuri, Mita
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (01) : 173 - 196
  • [37] Robust Speaker Identification Based On Hybrid Model of VQ and GMM-UBM
    Nguyen, Vu X.
    Nguyen, Vu P. H.
    Pham, Tuan V.
    2015 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2015, : 490 - 495
  • [38] Improved speaker based speech segmentation algorithm
    Lu, Jian
    Mao, Bing
    Sun, Zheng-Xing
    Zhang, Fu-Yan
    Ruan Jian Xue Bao/Journal of Software, 2002, 13 (02): : 274 - 279
  • [39] Closed-set speaker identification using VQ and GMM based models
    Bidhan Barai
    Tapas Chakraborty
    Nibaran Das
    Subhadip Basu
    Mita Nasipuri
    International Journal of Speech Technology, 2022, 25 : 173 - 196
  • [40] An improved transformation-based kernel estimator for population abundance with shoulder condition
    Albadareen, Baker
    Ismail, Noriszura
    Jaber, Jamil J.
    ITALIAN JOURNAL OF PURE AND APPLIED MATHEMATICS, 2021, (46): : 370 - 381