NORMALIZATION OF TOTAL VARIABILITY MATRIX FOR I-VECTOR/PLDA SPEAKER VERIFICATION

被引:0
|
作者
Rao, Wei [1 ]
Mak, Man-Wai [1 ]
Lee, Kong-Aik [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Hong Kong, Peoples R China
[2] ASTAR, Inst Infocomm Res, Human Language Technol Dept, Singapore, Singapore
关键词
Total variability matrix; i-vectors; probabilistic linear discriminant analysis; uncertainty propagation; speaker verification;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Gaussian PLDA with uncertainty propagation is effective for i-vector based speaker verification. The idea is to propagate the uncertainty of i-vectors caused by the duration variability of utterances to the PLDA model. However, a limitation of the method is the difficulty of performing length normalization on the posterior covariance matrix of an i-vector. This paper proposes a method to avoid performing length normalization on i-vectors in Gaussian PLDA modeling so that uncertainty propagation can be directly applied without transforming the posterior covariance matrices of i-vectors. Instead of performing length normalization on i-vectors independently, the proposed method normalizes the column vectors of the total variability matrix. Because the i-vectors of all utterances are derived from the same normalized total variability matrix, they will be subject to the same degree of normalization, thereby avoiding the undesirable distortion introduced by the utterance-dependent length normalization process. Experimental results on both NIST 2010 and 2012 SREs demonstrate that the proposed method achieves a performance similar to (and in some situations better than) that of Gaussian PLDA with length normalization. The method has the potential of improving the performance of uncertainty propagation for i-vector/PLDA speaker verification.
引用
收藏
页码:4180 / 4184
页数:5
相关论文
共 50 条
  • [41] Minimax i-vector extractor for short duration speaker verification
    Hautamaki, Ville
    Cheng, You-Chi
    Rajan, Padmanabhan
    Lee, Chin-Hui
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3675 - 3679
  • [42] Speaker normalization through feature shifting of linearly transformed i-vector
    Goo, Jahyun
    Kim, Younggwan
    Lim, Hyungjun
    Kim, Hoirin
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3489 - 3493
  • [43] A Comparison of Covariance Matrix and i-vector Based Speaker Recognition
    Jakovljevic, Niksa
    Jokic, Ivan
    Josic, Slobodan
    Delic, Vlado
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 37 - 45
  • [44] Effect of long-term ageing on i-vector speaker verification
    Kelly, Finnian
    Saeidi, Rahim
    Harte, Naomi
    van Leeuwen, David
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 86 - 90
  • [45] Single-sided Approach to Discriminative PLDA Training for Text-Independent Speaker Verification without Using Expanded I-vector
    Hirano, Ikuya
    Lee, Kong Aik
    Zhang, Zhaofeng
    Wang, Longbiao
    Kai, Atsuhiko
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 59 - +
  • [46] Discriminant Analysis Methods Comparison in I-Vector Space for Speaker Verification
    Mohammadi, Mohsen
    Mohammadi, Hamid Reza Sadegh
    [J]. 2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2018, : 166 - 172
  • [47] Deep Nonlinear Metric Learning for Speaker Verification in the I-Vector Space
    Feng, Yong
    Xiong, Qingyu
    Shi, Weiren
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (01): : 215 - 219
  • [48] Neural Networks based Channel Compensation for I-Vector Speaker Verification
    Rao, Wei
    Xiao, Xiong
    Xu, Chenglin
    Xu, Haihua
    Lee, Kong Aik
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [49] Best Feature Selection for Emotional Speaker Verification in i-vector Representation
    Mackova, Lenka
    Cizmar, Anton
    Juhar, Jozef
    [J]. 2015 25TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2015, : 209 - 212
  • [50] I-Vector DNN Scoring and Calibration for Noise Robust Speaker Verification
    Tan, Zhili
    Mak, Man-Wai
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1562 - 1566