PLDA-based Speaker Verification in Multi-Enrollment Scenario using Expected Vector Approach

被引:0
|
作者
Soni, Meet [1 ]
Panda, Ashish [1 ]
机构
[1] Tata Consultancy Serv, Mumbai, Maharashtra, India
关键词
Speaker Verification; Multi-session scoring; Multi-enrollment scoring; Expected Vector; END;
D O I
10.1109/ISCSLP49672.2021.9362113
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-Enrollment scoring scenario, where multiple utterances are available for an enrollment speaker, is one of the less explored problems in the Probabilistic Linear Discriminant Analysis (PLDA) scoring literature. Since the closed-form PLDA scoring formula for multi-enrollment scenario is impractical, alternate heuristic approaches are widely used for such scenarios in both i-vector and x-vector based speaker verification systems. In this paper, we describe an Expected Vector approach to obtain a vector from multiple enrollment utterances. Expected Vector approach uses a trained PLDA model to compute the expected class center given a set of vectors for that particular PLDA model. By using such an approach, a more meaningful class center representation can be obtained. This vector can be used to score a trial using two-vector scoring formula for a given PLDA model. We compare the performance of the proposed approach with various heuristic approaches and show that it provides significant improvements in terms of Equal Error Rate (EER) and minimum Detection Cost Function (minDCF). We show our results on x-vector system trained on Voxceleb dataset with various implementations of PLDA and trials designed on Voxceleb and Librispeech dataset.
引用
下载
收藏
页数:5
相关论文
共 50 条
  • [31] Deep neural network based i-vector mapping for speaker verification using short utterances
    Guo, Jinxi
    Xu, Ning
    Qian, Kailun
    Shi, Yang
    Xu, Kaiyuan
    Wu, Yingnian
    Alwan, Abeer
    SPEECH COMMUNICATION, 2018, 105 : 92 - 102
  • [32] Automatic Speaker Recognition :An Approach using DWT based Feature Extraction and Vector Quantization
    Singhai, Jyoti
    Singhai, Rakesh
    IETE TECHNICAL REVIEW, 2007, 24 (05) : 395 - 402
  • [33] Automatic speaker recognition : An approach using DWT based feature extraction and vector quantization
    Singhai, Jyoti
    Singhai, Rakesh
    IETE Technical Review (Institution of Electronics and Telecommunication Engineers, India), 2007, 24 (05): : 395 - 402
  • [34] Using the conformal embedding analysis to compensate the channel effect in the i-vector based speaker verification system
    Boulkenafet, Z.
    Bengherabi, M.
    Nouali, O.
    Cheriet, M.
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE OF THE BIOMETRICS SPECIAL INTEREST GROUP (BIOSIG 2013), 2013,
  • [35] MULTI-VIEW (JOINT) PROBABILITY LINEAR DISCRIMINATION ANALYSIS FOR J-VECTOR BASED TEXT DEPENDENT SPEAKER VERIFICATION
    Shi, Ziqiang
    Liu, Liu
    Wang, Mengjiao
    Liu, Rujie
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 614 - 620
  • [36] Multi-Task Learning with High-Order Statistics for X-vector based Text-Independent Speaker Verification
    You, Lanhua
    Guo, Wu
    Dai, Li-Rong
    Du, Jun
    INTERSPEECH 2019, 2019, : 1158 - 1162
  • [37] Robust text-independent speaker verification system using sonant-based approach for mandarin speech
    Deng, HJ
    Du, LM
    Wan, HJ
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IV, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING, 2003, : 377 - 382
  • [38] Multi-SNR GMMs-based noise-robust speaker verification using 1/fα noises
    Yang, Liping
    Gong, Weiguo
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 241 - +
  • [39] WHISPER TO NEUTRAL MAPPING USING I-VECTOR SPACE LIKELIHOOD AND A COSINE SIMILARITY BASED ITERATIVE OPTIMIZATION FOR WHISPERED SPEAKER VERIFICATION
    Naini, Abinay Reddy
    MV, Achuth Rao
    Ghosh, Prasanta Kumar
    2022 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2022, : 130 - 135
  • [40] Speaker identification using multi-modal i-vector approach for varying length speech in voice interactive systems
    Tiwari, Varun
    Hashmi, Mohammad Farukh
    Keskar, Avinash
    Shivaprakash, N. C.
    COGNITIVE SYSTEMS RESEARCH, 2019, 57 : 66 - 77