Content Normalization for Text-dependent Speaker Verification

被引：7

作者：

Dey, Subhadeep ^{[1
,2
]}

Madikeri, Srikanth ^{[1
]}

Motlicek, Petr ^{[1
]}

Ferras, Marc ^{[1
]}

机构：

[1] Idiap Res Inst, Martigny, Switzerland

[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

来源：

18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年

关键词：

speaker verification; i-vectors; content matching;

D O I：

10.21437/Interspeech.2017-1419

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Subspace based techniques, such as i-vector and Joint Factor Analysis (JFA) have shown to provide state-of-the-art performance for fixed phrase based text-dependent speaker verification. However, the error rates of such systems on the random digit task of RSR dataset are higher than that of Gaussian Mixture Model-Universal Background Model (GMM-UBM). In this paper, we aim at improving i-vector system by normalizing the content of the enrollment data to match the test data. We estimate i-vectors for each frames of a speech utterance (also called online i-vectors). The largest similarity scores across frames between enrollment and test are taken using these online i-vectors to obtain speaker verification scores. Experiments on Part3 of RSR corpora show that the proposed approach achieves 12% relative improvement in equal error rate over a GMM-UBM based baseline system.

引用

页码：1482 / 1486

页数：5

共 50 条

[41] Improving X-vector and PLDA for Text-dependent Speaker Verification
Chen, Zhuxin
Lin, Yue
[J]. INTERSPEECH 2020, 2020, : 726 - 730
[42] END-TO-END ATTENTION BASED TEXT-DEPENDENT SPEAKER VERIFICATION
Zhang, Shi-Xiong
Chen, Zhuo
Zhao, Yong
Li, Jinyu
Gong, Yifan
[J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 171 - 178
[43] Lexicon-Based Local Representation for Text-Dependent Speaker Verification
You, Hanxu
Li, Wei
Li, Lianqiang
Zhu, Jie
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (03): : 587 - 589
[44] Text-dependent Speaker Verification Using Word-based Scoring
Yao, Shengyu
Huang, Houjun
Zhou, Ruohua
Yan, Yonghong
[J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 314 - 318
[45] APPLICATION OF DYNAMIC TIME WARPING AND CEPSTROGRAMS TO TEXT-DEPENDENT SPEAKER VERIFICATION
Kaczmarek, Andrzej
Staworko, Michal
[J]. SPA 2009: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2009, : 169 - +
[46] On the study of replay and voice conversion attacks to text-dependent speaker verification
Wu, Zhizheng
Li, Haizhou
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (09) : 5311 - 5327
[47] COMPARISON OF MULTIPLE FEATURES AND MODELING METHODS FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Liu, Yi
He, Liang
Tian, Yao
Chen, Zhuzi
Liu, Jia
Johnson, Michael T.
[J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 629 - 636
[48] On the study of replay and voice conversion attacks to text-dependent speaker verification
Zhizheng Wu
Haizhou Li
[J]. Multimedia Tools and Applications, 2016, 75 : 5311 - 5327
[49] DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Dey, Subhadeep
Madikeri, Srikanth
Ferras, Marc
Modicek, Petr
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5050 - 5054
[50] Evaluation of the I-vector System for Text-dependent Speaker Verification
Li, Lin
Guo, Huiyang
Shang, Fengyi
Hong, Qingyang
Liu, Kai
[J]. PROCEEDINGS OF 2017 11TH IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2017, : 60 - 63

← 1 2 3 4 5 →