A NOVEL I-VECTOR FRAMEWORK USING MULTIPLE FEATURES AND PCA FOR SPEAKER RECOGNITION IN SHORT SPEECH CONDITION

被引:0
|
作者
Zhang, Chi [1 ]
Li, Xiaoqiang [1 ]
Li, Wei [2 ,3 ]
Lu, Peizhong [2 ]
Zhang, Wenqiang [2 ,3 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China
[2] Fudan Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China
[3] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Shanghai, Peoples R China
关键词
speaker recognition; short speech condition; PCA; i-vector; JOINT FACTOR-ANALYSIS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Speaker recognition in short speech condition is a difficult topic because the length of training and test speech is very short. One of the main disadvantage of the existing methods for speaker recognition is that they need very sufficient data and it's usually impossible in reality applications. In our experiments, the conventional methods with single feature don't make good performance in short speech. We propose a novel i-vector framework using multiple features and Principal Component Analysis (PCA) in short speech condition to overcome this difficulty, as multiple features combination can represent more aspects of a speaker. PCA is used to map the multiple features to an uncorrelated and orthogonal basis set to meet the requirements of Gaussian Mixture Model (GMM) with diagonal covariance matrices and i-vector. Improvement from the proposed approach compared to a state-of-the-art system are of roughly 50% relative at equal error rate when evaluated on the telephone conditions from the 2010 NIST speaker recognition evaluation (SRE).
引用
收藏
页码:499 / 503
页数:5
相关论文
共 50 条
  • [1] I-Vector Extraction Using Speaker Relevancy for Short Duration Speaker Recognition
    Kang, Woo Hyun
    Cho, Won Ik
    Jang, Se Young
    Lee, Hyeon Seung
    Kim, Nam Soo
    IT CONVERGENCE AND SECURITY 2017, VOL 1, 2018, 449 : 79 - 87
  • [2] i-vector Based Speaker Recognition on Short Utterances
    Kanagasundaram, Ahilan
    Vogt, Robbie
    Dean, David
    Sridharan, Sridha
    Mason, Michael
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2352 - +
  • [3] Maximum Likelihood i-vector Space Using PCA for Speaker Verification
    Lei, Zhenchun
    Yang, Yingchun
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2736 - 2739
  • [4] An i-vector Extractor Suitable for Speaker Recognition with both Microphone and Telephone Speech
    Senoussaoui, Mohammed
    Kenny, Patrick
    Dehak, Najim
    Dumouchel, Pierre
    ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 28 - 33
  • [5] Speech recognition in reverberant and noisy environments employing multiple feature extractors and i-vector speaker adaptation
    Md Jahangir Alam
    Vishwa Gupta
    Patrick Kenny
    Pierre Dumouchel
    EURASIP Journal on Advances in Signal Processing, 2015
  • [6] Speech recognition in reverberant and noisy environments employing multiple feature extractors and i-vector speaker adaptation
    Alam, Md Jahangir
    Gupta, Vishwa
    Kenny, Patrick
    Dumouchel, Pierre
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015, : 1 - 13
  • [7] Speaker Adaptation Using the I-Vector Technique for Bottleneck Features
    Cardinal, Patrick
    Dehak, Najim
    Zhang, Yu
    Glass, James
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2867 - 2871
  • [8] I-vector Based Speaker Gender Recognition
    Wang, Minghe
    Chen, Ying
    Tang, Zhenmin
    Zhang, Erhua
    2015 IEEE ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2015, : 729 - 732
  • [9] Emotional speaker recognition in real life conditions using multiple descriptors and i-vector speaker modeling technique
    Mansour, Asma
    Chenchah, Farah
    Lachiri, Zied
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (06) : 6441 - 6458
  • [10] Emotional speaker recognition in real life conditions using multiple descriptors and i-vector speaker modeling technique
    Asma Mansour
    Farah Chenchah
    Zied Lachiri
    Multimedia Tools and Applications, 2019, 78 : 6441 - 6458