Transformation-based GMM with improved cluster algorithm for speaker identification

被引:0
|
作者
Xu, Limin [1 ]
Tang, Zhenmin [1 ]
He, Keke [1 ]
Qian, Bo [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci, Nanjing, Peoples R China
关键词
gaussian mixture model (GMM); improved cluster algorithm; linear transformation; expectation-maximization (EM) algorithm;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The embedded linear transformation is a popular technique which integrates both transformation and diagonal-covariance Gaussian mixture into a unified framework to improve the performance of speaker recognition. However, the mixture number of GMM must be given in model training. The cluster expectation-maximization (EM) algorithm is a well-known technique in which the mixture number is regarded as an estimated parameter. This paper presents a new model that integrates an improved cluster algorithm into the estimating process of GMM with the embedded transformation. In the approach, the transformation matrix, the mixture number and other traditional model parameters are simultaneously estimated according to a maximum likelihood criterion. The proposed method is demonstrated on a database of three data sessions for text independent speaker identification. The experiments show that this method outperforms the traditional GMM with cluster EM algorithm.
引用
收藏
页码:1006 / +
页数:3
相关论文
共 50 条
  • [21] EM Algorithm with Initialization Based on Incremental k-means for GMM and Its Application to Speaker Identification
    Lee, Younjeong
    Seo, Changwoo
    Hahn, Hernsoo
    Lee, Kiyong
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2005, 24 (03): : 141 - 149
  • [22] A Novel Transformation-Based Algorithm for Reversible Logic Synthesis
    Wan, Sishuang
    Chen, Hanwu
    Cao, Rujin
    ADVANCES IN COMPUTATION AND INTELLIGENCE, PROCEEDINGS, 2009, 5821 : 70 - 81
  • [23] A Fast Transformation-Based Synthesis Algorithm for Reversible Circuits
    Ardestani, Ehsan K.
    Zamani, Morteza Saheb
    Sedighi, Mehdi
    11TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN - ARCHITECTURES, METHODS AND TOOLS : DSD 2008, PROCEEDINGS, 2008, : 803 - 806
  • [24] A Novel Coordinate Transformation-Based Texture Mapping Algorithm
    Zhao, Yue
    Cui, Xiaoyu
    Wang, Zhiqiong
    Yin, Ziming
    Feng, Cong
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 2104 - 2107
  • [25] Voice Transformation-based Spoofing of Text-Dependent Speaker Verification Systems
    Kons, Zvi
    Aronowitz, Hagai
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 945 - 949
  • [26] Text and Language Independent Speaker Identification by GMM based i Vector
    Kanrar, Soumen
    Jaiswal, Naveen
    6TH INTERNATIONAL CONFERENCE ON COMPUTER & COMMUNICATION TECHNOLOGY (ICCCT-2015), 2015, : 95 - 100
  • [27] Local fuzzy PCA based GMM with dimension reduction on speaker identification
    Lee, KY
    PATTERN RECOGNITION LETTERS, 2004, 25 (16) : 1811 - 1817
  • [28] Gender Identification of a Speaker Using MFCC and GMM
    Yucesoy, Ergun
    Nabiyev, Vasif V.
    2013 8TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO), 2013, : 626 - 629
  • [29] A hybrid GMM/SVM approach to speaker identification
    Fine, S
    Navrátil, J
    Gopinath, RA
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 417 - 420
  • [30] Speaker Transformation Using Length-variable Moving Window Based GMM
    Kang, Guangyu
    Guo, Shize
    HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 1, PROCEEDINGS, 2009, : 308 - +