Speaker recognition using Gaussian mixture models

被引:0
|
作者
Kamarauskas, J. [1 ]
机构
[1] Inst Math & Informat, LT-01108 Vilnius, Lithuania
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Gaussian Mixture models is one of the most popular statistical methods in speaker recognition. The purpose of this research is to perform experiments of speaker recognition using various feature vectors: four formants, four formants with fundamental frequency and mel cepstrum coefficients. Gaussian mixture models using mel cepstrum coefficients is baseline in speaker recognition and gives one of the best results in text independent speaker recognition. After implementing experiments of speaker recognition and comparing experimental results we can affirm that mel scale cepstral coefficients and four formants with fundamental frequency gives quite the same recognition accuracy, but creating of Gaussian mixture speaker models and recognition process continues a few times longer using mel scale cepstral coefficients, because count of calculations is few times greater in that case. Using only four formants gives the worst results of recognition accuracy.
引用
收藏
页码:29 / 32
页数:4
相关论文
共 50 条
  • [1] Speaker recognition for VoIP transmission using Gaussian mixture models
    Staroniewicz, P
    [J]. COMPUTER RECOGNITION SYSTEMS, PROCEEDINGS, 2005, : 739 - 745
  • [2] Skew Gaussian mixture models for speaker recognition
    Matza, Avi
    Bistritz, Yuval
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 12 - 15
  • [3] Skew Gaussian mixture models for speaker recognition
    Matza, Avi
    Bistritz, Yuval
    [J]. IET SIGNAL PROCESSING, 2014, 8 (08) : 860 - 867
  • [4] EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION
    Motlicek, Petr
    Dey, Subhadeep
    Madikeri, Srikanth
    Burget, Lukas
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4445 - 4449
  • [5] SPEAKER IDENTIFICATION AND VERIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
    REYNOLDS, DA
    [J]. SPEECH COMMUNICATION, 1995, 17 (1-2) : 91 - 108
  • [6] Combining Gaussian mixture models and segmental feature models for speaker recognition
    Milosevic, Milana
    Glavitsch, Ulrike
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2042 - 2043
  • [7] Automatic speaker recognition using a unique personal feature vector and Gaussian Mixture Models
    Kaminski, Kamil
    Majda, Ewelina
    Dobrowolski, Andrzej P.
    [J]. 2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 220 - 225
  • [8] Telephone based speaker recognition using multiple binary classifier and Gaussian Mixture Models
    Castellano, PJ
    Slomka, S
    Sridharan, S
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1075 - 1078
  • [9] Improved Approach for Calculating Model Parameters in Speaker Recognition using Gaussian Mixture Models
    Metkar, Prashant
    Cohen, Aaron
    Parhi, Keshab
    [J]. 2010 CONFERENCE RECORD OF THE FORTY FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2010, : 567 - 570
  • [10] Speaker verification using adapted Gaussian mixture models
    Reynolds, DA
    Quatieri, TF
    Dunn, RB
    [J]. DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 19 - 41