Enhancing the Performance of Gaussian Mixture Model-Based Text Independent Speaker Identification

被引：0

作者：

El-Gamal, M. A. ^{[1
]}

Abu El-Yazeed, M. F. ^{[2
]}

El Ayadi, M. M. H. ^{[3
]}

机构：

[1] Cairo Univ, Fac Engn, Dept Engn Phys & Math, Giza, Egypt

[2] Cairo Univ, Fac Engn, Dept Elect & Comm, Giza, Egypt

[3] Cairo Univ, Dept Eng Phys & Math, Giza, Egypt

来源：

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY | 2005年 / 8卷 / 01期

关键词：

Gaussian mixture model; goodness of fit; minimum description length; Akaike information criterion; linear discriminant analysis; text-independent speaker identification;

D O I：

10.1007/s10772-005-4764-8

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we seek to enhance the identification performance of Gaussian Mixture Model (GMM)based speaker identification systems in the presence of a limited amount of training data and a relatively large number of speakers. The performance is characterized by the identification accuracy, the identification time, and the model complexity. A new model order selection technique based on the Goodness of Fit (GOF) statistical test is proposed in order to increase the identification accuracy. This technique has shown to outperform other well known model order selection techniques like the Minimum Description Length (MDL) and the Akaike Information Criterion (AIC) in terms of the identification accuracy and the robustness against telephone channel degradation effects. In addition, the identification time is decreased by adapting the Linear Discriminative Analysis (LDA) feature extraction technique to fit our basic assumption of asymmetric multimodal distribution of the training data of each speaker. This modification results in a large decrease in the identification time with a little effect on the identification accuracy.

引用

页码：93 / 103

页数：11

共 50 条

[31] Performance Analysis of Text-Independent Speaker Identification System
Sekar, K.
INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 1735 - 1744
[32] A genetic algorithm based method for optimisation of Gaussian mixture model parameters for speaker identification
Mashao, DJ
Tsai, CT
6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 254 - 258
[33] Self-Organizing Mixture Models for Text-Independent Speaker Identification
Bouziane, Ayoub
Kharroubi, Jamal
Zarghili, Arsalane
2014 THIRD IEEE INTERNATIONAL COLLOQUIUM IN INFORMATION SCIENCE AND TECHNOLOGY (CIST'14), 2014, : 345 - 350
[34] Text-independent speaker identification
Gish, Herbert
Schmidt, Michael
IEEE SIGNAL PROCESSING MAGAZINE, 1994, 11 (04) : 18 - 32
[35] Gaussian mixture model-based contrast enhancement
Abdoli, Mohsen
Sarikhani, Hossein
Ghanbari, Mohammad
Brault, Patrice
IET IMAGE PROCESSING, 2015, 9 (07) : 569 - 577
[36] Robust Text-independent Speaker recognition with Short Utterances using Gaussian Mixture Models
Chakroun, Rania
Frikha, Mondher
2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 2204 - 2209
[37] Efficient text-independent speaker verification with structural Gaussian mixture models and neural network
Xiang, B
Berger, T
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 447 - 456
[38] Research on the Parameter Optimal Algorithm of Gaussian Mixture Model in Speaker Identification
Ding, Hui
Tang, Zhenmin
Li, Yanping
PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 639 - +
[39] SPEAKER IDENTIFICATION AND VERIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
REYNOLDS, DA
SPEECH COMMUNICATION, 1995, 17 (1-2) : 91 - 108
[40] Parameter optimization for Gaussian mixture model and its application in speaker identification
1600, ICIC Express Letters Office (07):

← 1 2 3 4 5 →