Enhancing the Performance of Gaussian Mixture Model-Based Text Independent Speaker Identification

被引:0
|
作者
El-Gamal, M. A. [1 ]
Abu El-Yazeed, M. F. [2 ]
El Ayadi, M. M. H. [3 ]
机构
[1] Cairo Univ, Fac Engn, Dept Engn Phys & Math, Giza, Egypt
[2] Cairo Univ, Fac Engn, Dept Elect & Comm, Giza, Egypt
[3] Cairo Univ, Dept Eng Phys & Math, Giza, Egypt
关键词
Gaussian mixture model; goodness of fit; minimum description length; Akaike information criterion; linear discriminant analysis; text-independent speaker identification;
D O I
10.1007/s10772-005-4764-8
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we seek to enhance the identification performance of Gaussian Mixture Model (GMM)based speaker identification systems in the presence of a limited amount of training data and a relatively large number of speakers. The performance is characterized by the identification accuracy, the identification time, and the model complexity. A new model order selection technique based on the Goodness of Fit (GOF) statistical test is proposed in order to increase the identification accuracy. This technique has shown to outperform other well known model order selection techniques like the Minimum Description Length (MDL) and the Akaike Information Criterion (AIC) in terms of the identification accuracy and the robustness against telephone channel degradation effects. In addition, the identification time is decreased by adapting the Linear Discriminative Analysis (LDA) feature extraction technique to fit our basic assumption of asymmetric multimodal distribution of the training data of each speaker. This modification results in a large decrease in the identification time with a little effect on the identification accuracy.
引用
收藏
页码:93 / 103
页数:11
相关论文
共 50 条
  • [31] Performance Analysis of Text-Independent Speaker Identification System
    Sekar, K.
    INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 1735 - 1744
  • [32] A genetic algorithm based method for optimisation of Gaussian mixture model parameters for speaker identification
    Mashao, DJ
    Tsai, CT
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 254 - 258
  • [33] Self-Organizing Mixture Models for Text-Independent Speaker Identification
    Bouziane, Ayoub
    Kharroubi, Jamal
    Zarghili, Arsalane
    2014 THIRD IEEE INTERNATIONAL COLLOQUIUM IN INFORMATION SCIENCE AND TECHNOLOGY (CIST'14), 2014, : 345 - 350
  • [34] Text-independent speaker identification
    Gish, Herbert
    Schmidt, Michael
    IEEE SIGNAL PROCESSING MAGAZINE, 1994, 11 (04) : 18 - 32
  • [35] Gaussian mixture model-based contrast enhancement
    Abdoli, Mohsen
    Sarikhani, Hossein
    Ghanbari, Mohammad
    Brault, Patrice
    IET IMAGE PROCESSING, 2015, 9 (07) : 569 - 577
  • [36] Robust Text-independent Speaker recognition with Short Utterances using Gaussian Mixture Models
    Chakroun, Rania
    Frikha, Mondher
    2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 2204 - 2209
  • [37] Efficient text-independent speaker verification with structural Gaussian mixture models and neural network
    Xiang, B
    Berger, T
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 447 - 456
  • [38] Research on the Parameter Optimal Algorithm of Gaussian Mixture Model in Speaker Identification
    Ding, Hui
    Tang, Zhenmin
    Li, Yanping
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 639 - +
  • [39] SPEAKER IDENTIFICATION AND VERIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
    REYNOLDS, DA
    SPEECH COMMUNICATION, 1995, 17 (1-2) : 91 - 108