I-VECTOR KULLBACK-LEIBLER DIVISIVE NORMALIZATION FOR PLDA SPEAKER VERIFICATION

被引:0
|
作者
Pan, Yilin [1 ]
Zheng, Tieran [1 ]
Chen, Chen [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
speaker verification; PLDA; Gaussianization; divisive normalization; Kullback-Leibler divergence; CORTEX;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
I-vector and Probabilistic Linear Discriminant Analysis (PLDA) represents the state-of-the-art in the speaker verification system. In PLDA, the i-vectors are assumed to follow Gaussian distribution. However, this assumption results in poor modeling without Gaussianization. Different from previous Gaussianization methods, in our proposed method, we make no restriction towards the original distribution of i-vectors for flexibility and universality. To optimize the Gaussian transformation function, Kullback-Leibler divergence (KLD) is introduced to measure the distance between the two distributions. By minimizing the KLD value under the development data, we can search out the optimal parameters in transformation function. The proposed method shows significant improvement on NIST SRE 2008 core set; together with length normalization (LN), a famous Gaussianization method, can further improve the verification accuracy.
引用
收藏
页码:56 / 60
页数:5
相关论文
共 50 条
  • [31] Mixture of PLDA Models in I-Vector Space for Gender-Independent Speaker Recognition
    Senoussaoui, Mohammed
    Kenny, Patrick
    Bruemmer, Niko
    de Villiers, Edward
    Dumouchel, Pierre
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 32 - +
  • [32] Sparsity Analysis and Compensation for i-Vector Based Speaker Verification
    Li, Wei
    Fu, Tian Fan
    Zhu, Jie
    Chen, Ning
    [J]. SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 381 - 388
  • [33] Linear feature vector compression using Kullback-Leibler distance
    Crysandt, Holger
    [J]. 2006 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2006, : 556 - 561
  • [34] An Adaptive i-Vector Extraction for Speaker Verification with Short Utterance
    Poddar, Arnab
    Sahidullah, Md
    Saha, Goutam
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 326 - 332
  • [35] Cosine Metric Learning for Speaker Verification in the i-Vector Space
    Bai, Zhong
    Zhang, Xiao-Lei
    Chen, Jingdong
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1126 - 1130
  • [36] Feature sparsity analysis for i-vector based speaker verification
    Li, Wei
    Fu, Tianfan
    You, Hanxu
    Zhu, Jie
    Chen, Ning
    [J]. SPEECH COMMUNICATION, 2016, 80 : 60 - 70
  • [37] Geometric Discriminant Analysis for I-vector Based Speaker Verification
    Xu, Can
    Chen, Xianhong
    He, Liang
    Liu, Jia
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1636 - 1640
  • [38] Bayesian Principal Component Analysis for I-Vector Speaker Verification
    Rong, Ya-Feng
    Chen, Chen
    Chen, De-Yun
    He, Yong-Jun
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2021, 49 (11): : 2186 - 2194
  • [39] WEIGHTED LDA TECHNIQUES FOR I-VECTOR BASED SPEAKER VERIFICATION
    Kanagasundaram, A.
    Dean, D.
    Vogt, R.
    McLaren, M.
    Sridharan, S.
    Mason, M.
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4781 - 4784
  • [40] The Inverse Kullback-Leibler Method for Fitting Vector Moving Averages
    McElroy, Tucker
    Roy, Anindya
    [J]. JOURNAL OF TIME SERIES ANALYSIS, 2018, 39 (02) : 172 - 191