Analysis of I-vector Length Normalization in Speaker Recognition Systems

被引:0
|
作者
Garcia-Romero, Daniel [1 ]
Espy-Wilson, Carol Y. [1 ]
机构
[1] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
关键词
speaker recognition; i-vectors; length normalization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a method to boost the performance of probabilistic generative models that work with i-vector representations. The proposed approach deals with the non-Gaussian behavior of i-vectors by performing a simple length normalization. This non-linear transformation allows the use of probabilistic models with Gaussian assumptions that yield equivalent performance to that of more complicated systems based on Heavy-Tailed assumptions. Significant performance improvements are demonstrated on the telephone portion of NIST SEE 2010.
引用
收藏
页码:256 / 259
页数:4
相关论文
共 50 条
  • [1] Evaluation of i-vector Speaker Recognition Systems for Forensic Application
    Mandasari, Miranti Indar
    McLaren, Mitchell
    van Leeuwen, David
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 28 - 31
  • [2] I-vector Based Speaker Gender Recognition
    Wang, Minghe
    Chen, Ying
    Tang, Zhenmin
    Zhang, Erhua
    [J]. 2015 IEEE ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2015, : 729 - 732
  • [3] DURATION MISMATCH COMPENSATION FOR I-VECTOR BASED SPEAKER RECOGNITION SYSTEMS
    Hasan, Taufiq
    Saeidi, Rahim
    Hansen, John H. L.
    van Leeuwen, David A.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7663 - 7667
  • [4] Applying Emotional Factor Analysis and I-Vector to Emotional Speaker Recognition
    Chen, Li
    Yang, Yingchun
    [J]. BIOMETRIC RECOGNITION: CCBR 2011, 2011, 7098 : 174 - 179
  • [5] i-vector Based Speaker Recognition on Short Utterances
    Kanagasundaram, Ahilan
    Vogt, Robbie
    Dean, David
    Sridharan, Sridha
    Mason, Michael
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2352 - +
  • [6] Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition
    Wang, Shuai
    Huang, Zili
    Qian, Yanmin
    Yu, Kai
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 195 - 199
  • [7] I-Vector Speaker and Language Recognition System on Android
    Vazquez-Machado, Christian
    Colon-Hernandez, Pedro
    Torres-Carrasquillo, Pedro A.
    [J]. 2016 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2016,
  • [8] Generalized cosine similarity in I-vector based automatic speaker recognition systems
    Drgas, Szymon
    Dabrowski, Adam
    [J]. 2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 73 - 77
  • [9] Generalizing I-Vector Estimation for Rapid Speaker Recognition
    Xu, Longting
    Lee, Kong Aik
    Li, Haizhou
    Yang, Zhen
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 749 - 759
  • [10] DEALING WITH ADDITIVE NOISE IN SPEAKER RECOGNITION SYSTEMS BASED ON I-VECTOR APPROACH
    Matrouf, D.
    Ben Kheder, W.
    Bousquet, P-M.
    Ajili, M.
    Bonastre, J-F.
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2092 - 2096