Experimental Study on GMM-Based Speaker Recognition

被引:1
|
作者
Ye, Wenxing [1 ]
Wu, Dapeng [1 ]
Nucci, Antonio [2 ]
机构
[1] Univ Florida, Gainesville, FL 32611 USA
[2] Narus Inc, Sunnyvale, CA 94085 USA
关键词
Speaker recognition; GMM; MFCC;
D O I
10.1117/12.849201
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Speaker recognition plays a very important role in the field of biometric security. In order to improve the recognition performance, many pattern recognition techniques have be explored in the literature. Among these techniques, the Gaussian Mixture Model (GMM) is proved to be an effective statistic model for speaker recognition and is used in most state-of-the-art speaker recognition systems. The GMM is used to represent the 'voice print' of a speaker through modeling the spectral characteristic of speech signals of the speaker. In this paper, we implement a speaker recognition system, which consists of preprocessing, Mel-Frequency Cepstrum Coefficients (MFCCs) based feature extraction, and GMM based classification. We test our system with TIDIGITS data set (325 speakers) and our own recordings of more than 200 speakers; our system achieves 100% correct recognition rate. Moreover, we also test our system under the scenario that training samples are from one language but test samples are from a different language; our system also achieves 100% correct recognition rate, which indicates that our system is language independent.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] A new common component GMM-based speaker recognition method
    Wang, YR
    Chiang, CY
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 645 - 648
  • [2] GMM-based Bhattacharyya kernel Fisher Discriminant Analysis for speaker recognition
    Chao, YH
    Wang, HM
    Chang, RC
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 649 - 652
  • [3] Speaker and session variability in GMM-based speaker verification
    Kenny, Patrick
    Boulianne, Gilles
    Ouellet, Pierre
    Dumouchel, Pierre
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1448 - 1460
  • [4] FPGA Implementation for GMM-Based Speaker Identification
    EhKan, Phaklen
    Allen, Timothy
    Quigley, Steven F.
    [J]. INTERNATIONAL JOURNAL OF RECONFIGURABLE COMPUTING, 2011, 2011
  • [5] Quantization for adapted GMM-based speaker verification
    Tseng, Ivy H.
    Verscheure, Olivier
    Turaga, Deepak S.
    Chaudhari, Upendra V.
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 653 - 656
  • [6] A GMM-Based Speaker Identification System on FPGA
    Kan, Phak Len Eh
    Allen, Tim
    Quigley, Steven F.
    [J]. RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2010, 5992 : 358 - 363
  • [7] GMM-based SVM for face recognition
    Bredin, Herve
    Dehak, Najim
    Chollet, Gerard
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS, 2006, : 1111 - +
  • [8] A GMM-based Probabilistic Sequence Kernel for Speaker Verification
    Lee, Kong-Aik
    You, Changhuai
    Li, Haizhou
    Kinnunen, Tomi
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1553 - 1556
  • [9] Evaluation of GMM-based Features for SVM Speaker Verification
    Liu, Minghui
    Huang, Zhongwei
    [J]. 2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 5027 - 5030
  • [10] An Improved GMM-based Clustering Algorithm for Efficient Speaker Identification
    Lin, Wenyong
    [J]. PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1490 - 1493