INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION

被引:0
|
作者
Madikeri, Srikanth [1 ]
Ferras, Marc [1 ]
Motlicek, Petr [1 ]
Dey, Subhadeep [1 ,2 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
i-vectors; PLDA; multi-session training; MULTI-SESSION; RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multi-session training conditions are becoming increasingly common in recent benchmark datasets for both text-independent and text-dependent speaker verification. In the state-of-the-art i-vector framework for speaker verification, such conditions are addressed by simple techniques such as averaging the individual i-vectors, averaging scores, or modifying the Probabilistic Linear Discriminant Analysis (PLDA) scoring hypothesis for multi-session enrollment. The aforementioned techniques fail to exploit the speaker variabilities observed in the enrollment data for target speakers. In this paper, we propose to exploit the multi-session training data by estimating a speaker-dependent covariance matrix and updating the intra-speaker covariance during PLDA scoring for each target speaker. The proposed method is further extended by combining covariance adaptation and score averaging. In this method, the individual examples of the target speaker are compared against the test data as opposed to an averaged i-vector, and the scores obtained are then averaged. The proposed methods are evaluated on the NIST SRE 2012 dataset. Relative improvements of up to 29% in equal error rate are obtained.
引用
收藏
页码:5365 / 5369
页数:5
相关论文
共 33 条
  • [1] Unifying Cosine and PLDA Back-ends for Speaker Verification
    Peng, Zhiyuan
    He, Xuanji
    Ding, Ke
    Lee, Tan
    Wan, Guanglu
    [J]. INTERSPEECH 2022, 2022, : 336 - 340
  • [2] Unsupervised Bayesian Adaptation of PLDA for Speaker Verification
    Borgstrorn, Bengt J.
    [J]. INTERSPEECH 2021, 2021, : 1039 - 1043
  • [3] Duration Dependent Covariance Regularization in PLDA Modeling for Speaker Verification
    Cai, Weicheng
    Li, Ming
    Li, Lin
    Hong, Qingyang
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1027 - 1031
  • [4] When Speaker Recognition Meets Noisy Labels: Optimizations for Front-Ends and Back-Ends
    Li, Lin
    Tong, Fuchuan
    Hong, Qingyang
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1586 - 1599
  • [5] Unsupervised Discriminative Training of PLDA for Domain Adaptation in Speaker Verification
    Wang, Qiongqiong
    Koshinaka, Takafumi
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3727 - 3731
  • [6] CHANNEL ADAPTATION OF PLDA FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
    Chen, Liping
    Lee, Kong Aik
    Ma, Bin
    Guo, Wu
    Li, Haizhou
    Dai, Li Rong
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5251 - 5255
  • [7] EFFECTS OF INTRA-CLASS CORRELATION ON COVARIANCE ANALYSIS
    SMITH, JH
    LEWIS, TO
    [J]. COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1982, 11 (01): : 71 - 80
  • [8] MULTIVARIATE MODEL WITH INTRA-CLASS COVARIANCE STRUCTURE
    HAQ, MS
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1974, 26 (03) : 413 - 420
  • [9] A CHARACTERIZATION OF A NORMAL INTRA-CLASS COVARIANCE-MATRIX
    ROGERS, GS
    [J]. SOUTH AFRICAN STATISTICAL JOURNAL, 1980, 14 (01) : 43 - 45
  • [10] Dataset-Invariant Covariance Normalization for Out-domain PLDA Speaker Verification
    Rahman, Md Hafizur
    Kanagasundaram, Ahilan
    Dean, David
    Sridharan, Sridha
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1017 - 1021