Experimental Study on GMM-Based Speaker Recognition

被引:1
|
作者
Ye, Wenxing [1 ]
Wu, Dapeng [1 ]
Nucci, Antonio [2 ]
机构
[1] Univ Florida, Gainesville, FL 32611 USA
[2] Narus Inc, Sunnyvale, CA 94085 USA
关键词
Speaker recognition; GMM; MFCC;
D O I
10.1117/12.849201
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Speaker recognition plays a very important role in the field of biometric security. In order to improve the recognition performance, many pattern recognition techniques have be explored in the literature. Among these techniques, the Gaussian Mixture Model (GMM) is proved to be an effective statistic model for speaker recognition and is used in most state-of-the-art speaker recognition systems. The GMM is used to represent the 'voice print' of a speaker through modeling the spectral characteristic of speech signals of the speaker. In this paper, we implement a speaker recognition system, which consists of preprocessing, Mel-Frequency Cepstrum Coefficients (MFCCs) based feature extraction, and GMM based classification. We test our system with TIDIGITS data set (325 speakers) and our own recordings of more than 200 speakers; our system achieves 100% correct recognition rate. Moreover, we also test our system under the scenario that training samples are from one language but test samples are from a different language; our system also achieves 100% correct recognition rate, which indicates that our system is language independent.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] A Study of Mutual Information for GMM-Based Spectral Conversion
    Hwang, Hsin-Te
    Tsao, Yu
    Wang, Hsin-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 78 - 81
  • [32] On the determination of optimal model order for GMM-based text-independent speaker identification
    Abu El-Yazeed, MF
    El Gamal, MA
    El Ayadi, MMH
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (08) : 1078 - 1087
  • [33] Enhancing the Performance of a GMM-based Speaker Identification System in a Multi-Microphone Setup
    Stergiou, Andreas
    Pnevmatikakis, Aristodemos
    Polymenakos, Lazaros C.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1463 - 1466
  • [34] On the Determination of Optimal Model Order for GMM-Based Text-Independent Speaker Identification
    MF Abu El-Yazeed
    MA El Gamal
    MMH El Ayadi
    [J]. EURASIP Journal on Advances in Signal Processing, 2004
  • [35] Data-driven Gaussian Component Selection for Fast GMM-Based Speaker Verification
    Zhang, Ce
    Zheng, Rong
    Xu, Bo
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 252 - 255
  • [36] Improved GMM-based language recognition using constrained MLLR transforms
    Shen, Wade
    Reynolds, Douglas
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4149 - 4152
  • [37] Improved GMM-based Speaker Verification Using SVM-Driven Impostor Dataset Selection
    McLaren, Mitchell
    Vogt, Robbie
    Baker, Brendan
    Sridharan, Sridha
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1271 - 1274
  • [38] Reassigned spectrum-based feature extraction for GMM-based automatic chord recognition
    Maksim Khadkevich
    Maurizio Omologo
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2013
  • [39] GMM-based classification of genomic sequences
    Akhtar, Mahmood
    Ambikairajah, Eliathamby
    Epps, Julien
    [J]. PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 103 - +
  • [40] Reassigned spectrum-based feature extraction for GMM-based automatic chord recognition
    Khadkevich, Maksim
    Omologo, Maurizio
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,