Single-sided Approach to Discriminative PLDA Training for Text-Independent Speaker Verification without Using Expanded I-vector

被引:0
|
作者
Hirano, Ikuya [1 ]
Lee, Kong Aik [2 ]
Zhang, Zhaofeng [3 ]
Wang, Longbiao [3 ]
Kai, Atsuhiko [1 ]
机构
[1] Shizuoka Univ, Shizuoka, Japan
[2] ASTAR, Inst Infocomm Res, Singapore, Singapore
[3] Nagaoka Univ Technol, Nagaoka, Niigata, Japan
关键词
speaker verification; discriminative training; Probabilistic Linear Discriminant Analysis; CLASSIFICATION; GMM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Probabilistic linear discriminant analysis (PLDA) has shown to be an effective model for disentangling speaker and channel variability in the i-vector space for text-independent speaker verification. The speaker and channel subspaces in the PLDA model are typically trained by optimizing the maximum likelihood (ML) criterion. PLDA assumes that i-vectors are normally distributed, which has shown to be violated in practice. This paper advocates the use of discriminative training, in which both target and non-target classes are taken into account to re-train the parameters. The efficacy of the proposed method is confirmed via experiments conducted on common condition 1 and 5 of the core task as specified in the Speaker Recognition Evaluations (SREs) 2010 conducted by the National Institute for Standards and Technology (NIST).
引用
下载
收藏
页码:59 / +
页数:3
相关论文
共 25 条
  • [1] Turkish Text-Dependent Speaker Verification using i-vector/PLDA Approach
    Hanilci, Cemal
    Celiktas, Havva
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [2] Weighted I-Vector Based Text-Independent Speaker Verification System
    Mohammadi, Mohsen
    Mohammadi, Hamid Reza Sadegh
    2019 27TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2019), 2019, : 1647 - 1653
  • [3] DEEP NEURAL NETWORK BASED DISCRIMINATIVE TRAINING FOR I-VECTOR/PLDA SPEAKER VERIFICATION
    Zheng Tieran
    Han Jiqing
    Zheng Guibin
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5354 - 5358
  • [4] DEEP BOTTLENECK FEATURES FOR I-VECTOR BASED TEXT-INDEPENDENT SPEAKER VERIFICATION
    Ghalehjegh, Sina Hamidi
    Rose, Richard C.
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 555 - 560
  • [5] I-vector Based Text-Independent Speaker Identification
    Liu, Tingting
    Kang, Kai
    Guan, Shengxiao
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 5420 - 5425
  • [6] PLDA in the i-supervector space for text-independent speaker verification
    Ye Jiang
    Kong Aik Lee
    Longbiao Wang
    EURASIP Journal on Audio, Speech, and Music Processing, 2014 (1)
  • [7] PLDA in the i-supervector space for text-independent speaker verification
    Jiang, Ye
    Lee, Kong Aik
    Wang, Longbiao
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014, : 1 - 13
  • [8] A discriminative training approach for text-independent speaker recognition
    Hong, QY
    Kwong, S
    SIGNAL PROCESSING, 2005, 85 (07) : 1449 - 1463
  • [9] Maximum Model Distance Discriminative Training for Text-Independent Speaker Verification
    Hong, Q. Y.
    Kwong, S.
    IECON 2004: 30TH ANNUAL CONFERENCE OF IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOL 2, 2004, : 1769 - 1774
  • [10] JOINT I-VECTOR WITH END-TO-END SYSTEM FOR SHORT DURATION TEXT-INDEPENDENT SPEAKER VERIFICATION
    Huang, Zili
    Wang, Shuai
    Qian, Yanmin
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4869 - 4873