Experimental Study on GMM-Based Speaker Recognition

被引：1

作者：

Ye, Wenxing ^{[1
]}

Wu, Dapeng ^{[1
]}

Nucci, Antonio ^{[2
]}

机构：

[1] Univ Florida, Gainesville, FL 32611 USA

[2] Narus Inc, Sunnyvale, CA 94085 USA

来源：

MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2010 | 2010年 / 7708卷

关键词：

Speaker recognition; GMM; MFCC;

D O I：

10.1117/12.849201

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Speaker recognition plays a very important role in the field of biometric security. In order to improve the recognition performance, many pattern recognition techniques have be explored in the literature. Among these techniques, the Gaussian Mixture Model (GMM) is proved to be an effective statistic model for speaker recognition and is used in most state-of-the-art speaker recognition systems. The GMM is used to represent the 'voice print' of a speaker through modeling the spectral characteristic of speech signals of the speaker. In this paper, we implement a speaker recognition system, which consists of preprocessing, Mel-Frequency Cepstrum Coefficients (MFCCs) based feature extraction, and GMM based classification. We test our system with TIDIGITS data set (325 speakers) and our own recordings of more than 200 speakers; our system achieves 100% correct recognition rate. Moreover, we also test our system under the scenario that training samples are from one language but test samples are from a different language; our system also achieves 100% correct recognition rate, which indicates that our system is language independent.

引用

页数：9

共 50 条

[31] A Study of Mutual Information for GMM-Based Spectral Conversion
Hwang, Hsin-Te
Tsao, Yu
Wang, Hsin-Min
Wang, Yih-Ru
Chen, Sin-Horng
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 78 - 81
[32] On the determination of optimal model order for GMM-based text-independent speaker identification
Abu El-Yazeed, MF
El Gamal, MA
El Ayadi, MMH
[J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (08) : 1078 - 1087
[33] Enhancing the Performance of a GMM-based Speaker Identification System in a Multi-Microphone Setup
Stergiou, Andreas
Pnevmatikakis, Aristodemos
Polymenakos, Lazaros C.
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1463 - 1466
[34] On the Determination of Optimal Model Order for GMM-Based Text-Independent Speaker Identification
MF Abu El-Yazeed
MA El Gamal
MMH El Ayadi
[J]. EURASIP Journal on Advances in Signal Processing, 2004
[35] Data-driven Gaussian Component Selection for Fast GMM-Based Speaker Verification
Zhang, Ce
Zheng, Rong
Xu, Bo
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 252 - 255
[36] Improved GMM-based language recognition using constrained MLLR transforms
Shen, Wade
Reynolds, Douglas
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4149 - 4152
[37] Improved GMM-based Speaker Verification Using SVM-Driven Impostor Dataset Selection
McLaren, Mitchell
Vogt, Robbie
Baker, Brendan
Sridharan, Sridha
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1271 - 1274
[38] Reassigned spectrum-based feature extraction for GMM-based automatic chord recognition
Maksim Khadkevich
Maurizio Omologo
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2013
[39] GMM-based classification of genomic sequences
Akhtar, Mahmood
Ambikairajah, Eliathamby
Epps, Julien
[J]. PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 103 - +
[40] Reassigned spectrum-based feature extraction for GMM-based automatic chord recognition
Khadkevich, Maksim
Omologo, Maurizio
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,

← 1 2 3 4 5 →