Cosine Metric Learning for Speaker Verification in the i-Vector Space

被引:0
|
作者
Bai, Zhong [1 ]
Zhang, Xiao-Lei
Chen, Jingdong
机构
[1] Northwestern Polytech Univ, Ctr Intelligent Acoust & Immers Commun, Fremont, CA 94539 USA
基金
中国国家自然科学基金;
关键词
speaker verification; cosine metric learning; channel and session compensation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is known that the equal-error-rate (EER) performance of a speaker verification system is determined by the overlap region of the decision scores of true and imposter trials. Also, the cosine similarity scores of the true or imposter trials produced by the state-of-the-art i-vector front-end approximate to a Gaussian distribution, and the overlap region of the two classes of trials depends mainly on their between-class distance. Motivated by the above facts, this paper presents a cosine similarity learning (CML) framework for speaker verification, which combines classical compensation techniques and the cosine similarity scoring for improving the EER performance. CML minimizes the overlap region by enlarging the between-class distance while introducing a regularization term to control the with-in class variance, which is initialized by a traditional channel compensation technique such as linear discriminant analysis. Experiments are carried out to compare the proposed CML framework with several traditional channel compensation baselines on the NIST speaker recognition evaluation data sets. The results show that CML outperforms all the studied initialization compensation techniques.
引用
收藏
页码:1126 / 1130
页数:5
相关论文
共 50 条
  • [1] Deep Nonlinear Metric Learning for Speaker Verification in the I-Vector Space
    Feng, Yong
    Xiong, Qingyu
    Shi, Weiren
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (01): : 215 - 219
  • [2] Bayesian Distance Metric Learning on i-vector for Speaker Verification
    Fang, Xiao
    Dehak, Najim
    Glass, James
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2513 - 2517
  • [3] Pairwise Discriminative Speaker Verification in the I-Vector Space
    Cumani, Sandro
    Bruemmer, Niko
    Burget, Lukas
    Laface, Pietro
    Plchot, Oldrich
    Vasilakakis, Vasileios
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (06): : 1217 - 1227
  • [4] Joint Speaker Verification and Antispoofing in the i-Vector Space
    Sizov, Aleksandr
    Khoury, Elie
    Kinnunen, Tomi
    Wu, Zhizheng
    Marcel, Sebastien
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2015, 10 (04) : 821 - 832
  • [5] Whisper to neutral mapping using cosine similarity maximization in i-vector space for speaker verification
    Naini, Abinay Reddy
    Rao, Achuth M., V
    Ghosh, Prasanta Kumar
    [J]. INTERSPEECH 2019, 2019, : 4340 - 4344
  • [6] FAST DISCRIMINATIVE SPEAKER VERIFICATION IN THE I-VECTOR SPACE
    Cumani, Sandro
    Bruemmer, Niko
    Burget, Lukas
    Laface, Pietro
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4852 - 4855
  • [7] Large Margin Nearest Neighborhood Metric Learning for I-Vector Based Speaker Verification
    Ahmad, Waquar
    Karnick, Harish
    Hegde, Rajesh M.
    [J]. CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 827 - 832
  • [8] PLDA Modeling in I-Vector and Supervector Space for Speaker Verification
    Jiang, Ye
    Lee, Kong Aik
    Tang, Zhenmin
    Ma, Bin
    Larcher, Anthony
    Li, Haizhou
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1678 - 1681
  • [9] An I-Vector Backend for Speaker Verification
    Kenny, Patrick
    Stafylakis, Themos
    Alam, Jahangir
    Kockmann, Marcel
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2307 - 2311
  • [10] Discriminant Analysis Methods Comparison in I-Vector Space for Speaker Verification
    Mohammadi, Mohsen
    Mohammadi, Hamid Reza Sadegh
    [J]. 2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2018, : 166 - 172