INTRA-CLASS COVARIANCE ADAPTATION IN PLDA BACK-ENDS FOR SPEAKER VERIFICATION

被引:0
|
作者
Madikeri, Srikanth [1 ]
Ferras, Marc [1 ]
Motlicek, Petr [1 ]
Dey, Subhadeep [1 ,2 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
i-vectors; PLDA; multi-session training; MULTI-SESSION; RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multi-session training conditions are becoming increasingly common in recent benchmark datasets for both text-independent and text-dependent speaker verification. In the state-of-the-art i-vector framework for speaker verification, such conditions are addressed by simple techniques such as averaging the individual i-vectors, averaging scores, or modifying the Probabilistic Linear Discriminant Analysis (PLDA) scoring hypothesis for multi-session enrollment. The aforementioned techniques fail to exploit the speaker variabilities observed in the enrollment data for target speakers. In this paper, we propose to exploit the multi-session training data by estimating a speaker-dependent covariance matrix and updating the intra-speaker covariance during PLDA scoring for each target speaker. The proposed method is further extended by combining covariance adaptation and score averaging. In this method, the individual examples of the target speaker are compared against the test data as opposed to an averaged i-vector, and the scores obtained are then averaged. The proposed methods are evaluated on the NIST SRE 2012 dataset. Relative improvements of up to 29% in equal error rate are obtained.
引用
收藏
页码:5365 / 5369
页数:5
相关论文
共 33 条
  • [31] Class-Aware Distribution Alignment based Unsupervised Domain Adaptation for Speaker Verification
    Hu, Hang-Rui
    Song, Yan
    Dai, Li-Rong
    McLoughlin, Ian
    Liu, Lin
    [J]. INTERSPEECH 2022, 2022, : 3689 - 3693
  • [32] Few-Shot Classification With Intra-Class Unrelated Multi-Prototype Representation and Episode Adaptation Strategy
    Wen Zhijie
    Zhang Qi
    Wang Wei
    Ma Liyan
    [J]. 2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 1042 - 1049
  • [33] DOMAIN ADAPTATION VIA WITHIN-CLASS COVARIANCE CORRECTION IN I-VECTOR BASED SPEAKER RECOGNITION SYSTEMS
    Glembek, Ondrej
    Ma, Jeff
    Matejka, Pavel
    Zhang, Bing
    Plchot, Oldrich
    Burget, Lukas
    Matsoukas, Spyros
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,