Content Normalization for Text-dependent Speaker Verification

被引:7
|
作者
Dey, Subhadeep [1 ,2 ]
Madikeri, Srikanth [1 ]
Motlicek, Petr [1 ]
Ferras, Marc [1 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
speaker verification; i-vectors; content matching;
D O I
10.21437/Interspeech.2017-1419
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Subspace based techniques, such as i-vector and Joint Factor Analysis (JFA) have shown to provide state-of-the-art performance for fixed phrase based text-dependent speaker verification. However, the error rates of such systems on the random digit task of RSR dataset are higher than that of Gaussian Mixture Model-Universal Background Model (GMM-UBM). In this paper, we aim at improving i-vector system by normalizing the content of the enrollment data to match the test data. We estimate i-vectors for each frames of a speech utterance (also called online i-vectors). The largest similarity scores across frames between enrollment and test are taken using these online i-vectors to obtain speaker verification scores. Experiments on Part3 of RSR corpora show that the proposed approach achieves 12% relative improvement in equal error rate over a GMM-UBM based baseline system.
引用
收藏
页码:1482 / 1486
页数:5
相关论文
共 50 条
  • [41] Improving X-vector and PLDA for Text-dependent Speaker Verification
    Chen, Zhuxin
    Lin, Yue
    [J]. INTERSPEECH 2020, 2020, : 726 - 730
  • [42] END-TO-END ATTENTION BASED TEXT-DEPENDENT SPEAKER VERIFICATION
    Zhang, Shi-Xiong
    Chen, Zhuo
    Zhao, Yong
    Li, Jinyu
    Gong, Yifan
    [J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 171 - 178
  • [43] Lexicon-Based Local Representation for Text-Dependent Speaker Verification
    You, Hanxu
    Li, Wei
    Li, Lianqiang
    Zhu, Jie
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (03): : 587 - 589
  • [44] Text-dependent Speaker Verification Using Word-based Scoring
    Yao, Shengyu
    Huang, Houjun
    Zhou, Ruohua
    Yan, Yonghong
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 314 - 318
  • [45] APPLICATION OF DYNAMIC TIME WARPING AND CEPSTROGRAMS TO TEXT-DEPENDENT SPEAKER VERIFICATION
    Kaczmarek, Andrzej
    Staworko, Michal
    [J]. SPA 2009: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2009, : 169 - +
  • [46] On the study of replay and voice conversion attacks to text-dependent speaker verification
    Wu, Zhizheng
    Li, Haizhou
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (09) : 5311 - 5327
  • [47] COMPARISON OF MULTIPLE FEATURES AND MODELING METHODS FOR TEXT-DEPENDENT SPEAKER VERIFICATION
    Liu, Yi
    He, Liang
    Tian, Yao
    Chen, Zhuzi
    Liu, Jia
    Johnson, Michael T.
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 629 - 636
  • [48] On the study of replay and voice conversion attacks to text-dependent speaker verification
    Zhizheng Wu
    Haizhou Li
    [J]. Multimedia Tools and Applications, 2016, 75 : 5311 - 5327
  • [49] DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION
    Dey, Subhadeep
    Madikeri, Srikanth
    Ferras, Marc
    Modicek, Petr
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5050 - 5054
  • [50] Evaluation of the I-vector System for Text-dependent Speaker Verification
    Li, Lin
    Guo, Huiyang
    Shang, Fengyi
    Hong, Qingyang
    Liu, Kai
    [J]. PROCEEDINGS OF 2017 11TH IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2017, : 60 - 63