Improving X-vector and PLDA for Text-dependent Speaker Verification

被引:5
|
作者
Chen, Zhuxin [1 ]
Lin, Yue [1 ]
机构
[1] NetEase Games AI Lab, Hangzhou, Peoples R China
来源
关键词
speech verification; x-vector; PLDA; SDSVC; 2020; short utterance;
D O I
10.21437/Interspeech.2020-1188
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Recently, the pipeline consisting of an x-vector speaker embedding front-end and a Probabilistic Linear Discriminant Analysis (PLDA) back-end has achieved state-of-the-art results in text-independent speaker verification. In this paper, we further improve the performance of x-vector and PLDA based system for text-dependent speaker verification by exploring the choice of layer to produce embedding and modifying the back-end training strategies. In particular, we probe that x-vector based embeddings, specifically the standard deviation statistics in the pooling layer, contain the information related to both speaker characteristics and spoken content. Accordingly, we modify the back-end training labels by utilizing both of the speaker-id and phrase-id. A correlation-alignment-based PLDA adaptation is also adopted to make use of the text-independent labeled data during back-end training. Experimental results on the SDSVC 2020 dataset show that our proposed methods achieve significant performance improvement compared with the x-vector and HMM based i-vector baselines.
引用
收藏
页码:726 / 730
页数:5
相关论文
共 50 条
  • [1] Turkish Text-Dependent Speaker Verification using i-vector/PLDA Approach
    Hanilci, Cemal
    Celiktas, Havva
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [2] Speaker-Phrase-Specific Adaptation of PLDA Model for Improved Performance in Text-Dependent Speaker Verification
    Laskar, Mohammad Azharuddin
    Bhanja, Chuya China
    Laskar, Rabul Hussain
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (10) : 5127 - 5151
  • [3] Speaker-Phrase-Specific Adaptation of PLDA Model for Improved Performance in Text-Dependent Speaker Verification
    Mohammad Azharuddin Laskar
    Chuya China Bhanja
    Rabul Hussain Laskar
    [J]. Circuits, Systems, and Signal Processing, 2021, 40 : 5127 - 5151
  • [4] Linear transformation on x-vector for text-independent speaker verification
    Xu, Longting
    Ren, Bo
    Zhang, Guanglin
    Yang, Jichen
    [J]. ELECTRONICS LETTERS, 2019, 55 (15) : 864 - 865
  • [5] An Adaptive X-vector Model for Text-independent Speaker Verification
    Gu, Bin
    Guo, Wu
    Ding, Penguin
    Ling, Zhenhua
    Du, Jun
    [J]. INTERSPEECH 2020, 2020, : 1506 - 1510
  • [6] Text-dependent speaker recognition using PLDA with uncertainty propagation
    Stafylakis, T.
    Kenny, P.
    Ouellet, P.
    Perez, J.
    Kockmann, M.
    Dumouchel, P.
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3651 - 3655
  • [7] Regularized Within-Class Precision Matrix Based PLDA in Text-Dependent Speaker Verification
    Yoon, Sung-Hyun
    Jeon, Jong-June
    Yu, Ha-Jin
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (18):
  • [8] PHONETICALLY-CONSTRAINED PLDA MODELING FOR TEXT-DEPENDENT SPEAKER VERIFICATION WITH MULTIPLE SHORT UTTERANCES
    Larcher, Anthony
    Lee, Kong Aik
    Ma, Bin
    Li, Haizhou
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7673 - 7677
  • [9] Evaluation of the I-vector System for Text-dependent Speaker Verification
    Li, Lin
    Guo, Huiyang
    Shang, Fengyi
    Hong, Qingyang
    Liu, Kai
    [J]. PROCEEDINGS OF 2017 11TH IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2017, : 60 - 63
  • [10] Text-Dependent Speaker Verification System: A Review
    Debnath, Saswati
    Soni, B.
    Baruah, U.
    Sah, D. K.
    [J]. PROCEEDINGS OF 2015 IEEE 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO), 2015,