Bayesian Distance Metric Learning on i-vector for Speaker Verification

被引:0
|
作者
Fang, Xiao [1 ]
Dehak, Najim [1 ]
Glass, James [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
i-vector; score normalization; distance metric learning; channel compensation; limited training utterances;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new speaker verification system based on i-vector modeling as a feature extractor. In this modeling, we explore the distance constraints between i-vector pairs from the same speaker and different speakers. With an approximation of the distance metric as a weighted covariance matrix of the top eigenvectors from the data covariance matrix, variational inference is used to estimate a posterior distribution for the distance metric. Given speaker labels, we select different-speaker data pairs with the highest cosine scores to form a different speaker constraint set. This set captures the most discriminative between-speaker variability in the training data. This Bayesian distance metric learning approach achieves better performance than state-of-the-art method. Furthermore, this approach is insensitive to score normalization, as compared to cosine scoring. Without the requirement of the number of labeled examples, this approach performs very well in the context of limited training data.
引用
收藏
页码:2513 / 2517
页数:5
相关论文
共 50 条
  • [1] Cosine Metric Learning for Speaker Verification in the i-Vector Space
    Bai, Zhong
    Zhang, Xiao-Lei
    Chen, Jingdong
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1126 - 1130
  • [2] Deep Nonlinear Metric Learning for Speaker Verification in the I-Vector Space
    Feng, Yong
    Xiong, Qingyu
    Shi, Weiren
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (01): : 215 - 219
  • [3] Bayesian Principal Component Analysis for I-Vector Speaker Verification
    Rong Y.-F.
    Chen C.
    Chen D.-Y.
    He Y.-J.
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2021, 49 (11): : 2186 - 2194
  • [4] Large Margin Nearest Neighborhood Metric Learning for I-Vector Based Speaker Verification
    Ahmad, Waquar
    Karnick, Harish
    Hegde, Rajesh M.
    [J]. CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 827 - 832
  • [5] An I-Vector Backend for Speaker Verification
    Kenny, Patrick
    Stafylakis, Themos
    Alam, Jahangir
    Kockmann, Marcel
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2307 - 2311
  • [6] Pairwise Discriminative Speaker Verification in the I-Vector Space
    Cumani, Sandro
    Bruemmer, Niko
    Burget, Lukas
    Laface, Pietro
    Plchot, Oldrich
    Vasilakakis, Vasileios
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (06): : 1217 - 1227
  • [7] Feature Switching in the i-vector Framework for Speaker Verification
    Asha, T.
    Saranya, M. S.
    Pandia, Karthik D. S.
    Madikeri, Srikanth
    Murthy, Hema A.
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1125 - 1129
  • [8] Joint Speaker Verification and Antispoofing in the i-Vector Space
    Sizov, Aleksandr
    Khoury, Elie
    Kinnunen, Tomi
    Wu, Zhizheng
    Marcel, Sebastien
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2015, 10 (04) : 821 - 832
  • [9] i-Vector with sparse representation classification for speaker verification
    Kua, Jia Min Karen
    Epps, Julien
    Ambikairajah, Eliathamby
    [J]. SPEECH COMMUNICATION, 2013, 55 (05) : 707 - 720
  • [10] FAST DISCRIMINATIVE SPEAKER VERIFICATION IN THE I-VECTOR SPACE
    Cumani, Sandro
    Bruemmer, Niko
    Burget, Lukas
    Laface, Pietro
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4852 - 4855