I-VECTOR KULLBACK-LEIBLER DIVISIVE NORMALIZATION FOR PLDA SPEAKER VERIFICATION

被引:0
|
作者
Pan, Yilin [1 ]
Zheng, Tieran [1 ]
Chen, Chen [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
speaker verification; PLDA; Gaussianization; divisive normalization; Kullback-Leibler divergence; CORTEX;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
I-vector and Probabilistic Linear Discriminant Analysis (PLDA) represents the state-of-the-art in the speaker verification system. In PLDA, the i-vectors are assumed to follow Gaussian distribution. However, this assumption results in poor modeling without Gaussianization. Different from previous Gaussianization methods, in our proposed method, we make no restriction towards the original distribution of i-vectors for flexibility and universality. To optimize the Gaussian transformation function, Kullback-Leibler divergence (KLD) is introduced to measure the distance between the two distributions. By minimizing the KLD value under the development data, we can search out the optimal parameters in transformation function. The proposed method shows significant improvement on NIST SRE 2008 core set; together with length normalization (LN), a famous Gaussianization method, can further improve the verification accuracy.
引用
收藏
页码:56 / 60
页数:5
相关论文
共 50 条
  • [1] NORMALIZATION OF TOTAL VARIABILITY MATRIX FOR I-VECTOR/PLDA SPEAKER VERIFICATION
    Rao, Wei
    Mak, Man-Wai
    Lee, Kong-Aik
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4180 - 4184
  • [2] Fast Scoring for Mixture of PLDA in I-Vector/PLDA Speaker Verification
    Mak, Man-Wai
    [J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 587 - 593
  • [3] PLDA Modeling in I-Vector and Supervector Space for Speaker Verification
    Jiang, Ye
    Lee, Kong Aik
    Tang, Zhenmin
    Ma, Bin
    Larcher, Anthony
    Li, Haizhou
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1678 - 1681
  • [4] Non-linear PLDA for i-Vector Speaker Verification
    Novoselov, Sergey
    Pekhovsky, Timur
    Kudashev, Oleg
    Mendelev, Valentin
    Prudnikov, Alexey
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 214 - 218
  • [5] Nonparametrically trained PLDA for short duration i-vector speaker verification
    Khosravani, Abbas
    Homayounpour, Mohammad M.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 105 - 122
  • [6] A Monte-Carlo method for score normalization in automatic speaker verification using Kullback-Leibler distances
    Ben, M
    Blouet, R
    Bimbot, F
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 689 - 692
  • [7] DEEP NEURAL NETWORK DRIVEN MIXTURE OF PLDA FOR ROBUST I-VECTOR SPEAKER VERIFICATION
    Li, Na
    Mak, Man-Wai
    Chien, Jen-Tzung
    [J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 186 - 191
  • [8] DEEP NEURAL NETWORK BASED DISCRIMINATIVE TRAINING FOR I-VECTOR/PLDA SPEAKER VERIFICATION
    Zheng Tieran
    Han Jiqing
    Zheng Guibin
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5354 - 5358
  • [9] Turkish Text-Dependent Speaker Verification using i-vector/PLDA Approach
    Hanilci, Cemal
    Celiktas, Havva
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [10] Feature selection for fusion of speaker verification via Maximum Kullback-Leibler Distance
    Liu, Di
    Sun, Dong-Mei
    Qiu, Zheng-Ding
    [J]. 2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 565 - 568