Content Normalization for Text-dependent Speaker Verification

被引：7

作者：

Dey, Subhadeep ^{[1
,2
]}

Madikeri, Srikanth ^{[1
]}

Motlicek, Petr ^{[1
]}

Ferras, Marc ^{[1
]}

机构：

[1] Idiap Res Inst, Martigny, Switzerland

[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

来源：

18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年

关键词：

speaker verification; i-vectors; content matching;

D O I：

10.21437/Interspeech.2017-1419

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Subspace based techniques, such as i-vector and Joint Factor Analysis (JFA) have shown to provide state-of-the-art performance for fixed phrase based text-dependent speaker verification. However, the error rates of such systems on the random digit task of RSR dataset are higher than that of Gaussian Mixture Model-Universal Background Model (GMM-UBM). In this paper, we aim at improving i-vector system by normalizing the content of the enrollment data to match the test data. We estimate i-vectors for each frames of a speech utterance (also called online i-vectors). The largest similarity scores across frames between enrollment and test are taken using these online i-vectors to obtain speaker verification scores. Experiments on Part3 of RSR corpora show that the proposed approach achieves 12% relative improvement in equal error rate over a GMM-UBM based baseline system.

引用

页码：1482 / 1486

页数：5

共 50 条

[31] Multi-Task Learning for Text-dependent Speaker Verification
Chen, Nanxin
Qian, Yanmin
Yu, Kai
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 185 - 189
[32] EXPLORING SEQUENTIAL CHARACTERISTICS IN SPEAKER BOTTLENECK FEATURE FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Chen, Liping
Zhao, Yong
Zhang, Shi-Xiong
Li, Jie
Ye, Guoli
Soong, Frank
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5364 - 5368
[33] Addressing Text-Dependent Speaker Verification Using Singing Speech
Shi, Yan
Zhou, Juanjuan
Long, Yanhua
Li, Yijie
Mao, Hongwei
[J]. APPLIED SCIENCES-BASEL, 2019, 9 (13):
[34] ON INSTANTANEOUS AND TRANSITIONAL SPECTRAL INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
BERNASCONI, C
[J]. SPEECH COMMUNICATION, 1990, 9 (02) : 129 - 139
[35] EFFECTS OF GENDER INFORMATION IN TEXT-INDEPENDENT AND TEXT-DEPENDENT SPEAKER VERIFICATION
Kanervisto, Anssi
Vestman, Ville
Sahidullah, Md
Hautamaki, Ville
Kinnunen, Tomi
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5360 - 5364
[36] Speaker-dependent Dictionary-based Speech Enhancement for Text-Dependent Speaker Verification
Thomsen, Nicolai Baek
Thomsen, Dennis Alexander Lehmann
Tan, Zheng-Hua
Lindberg, Borge
Jensen, Soren Holdt
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1839 - 1843
[37] Weighting scores to improve speaker-dependent threshold estimation in text-dependent speaker verification
Saeta, JR
Hernando, J
[J]. NONLINEAR ANALYSES AND ALGORITHMS FOR SPEECH PROCESSING, 2005, 3817 : 81 - 91
[38] Text-dependent speaker verification: Classifiers, databases and RSR2015
Larcher, Anthony
Lee, Kong Aik
Ma, Bin
Li, Haizhou
[J]. SPEECH COMMUNICATION, 2014, 60 : 56 - 77
[39] Parameterization of the score threshold for a text-dependent adaptive speaker verification system
Mirghafori, N
Hébert, M
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 361 - 364
[40] DEEP NEURAL NETWORKS FOR SMALL FOOTPRINT TEXT-DEPENDENT SPEAKER VERIFICATION
Variani, Ehsan
Lei, Xin
McDermott, Erik
Moreno, Ignacio Lopez
Gonzalez-Dominguez, Javier
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,

← 1 2 3 4 5 →