UTILIZATION OF UNLABELED DEVELOPMENT DATA FOR SPEAKER VERIFICATION

被引：0

作者：

Liu, Gang ^{[1
]}

Yu, Chengzhu ^{[1
]}

Shokouhi, Navid ^{[1
]}

Misra, Abhinav ^{[1
]}

Xing, Hua ^{[1
]}

Hansen, John H. L. ^{[1
]}

机构：

[1] Univ Texas Dallas, CRSS, Richardson, TX 75080 USA

来源：

2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014 | 2014年

关键词：

Clustering; Speaker verification; PLDA; i-Vector; Universal imposter clustering;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

State-of-the-art speaker verification systems model speaker identity by mapping i-Vectors onto a probabilistic linear discriminant analysis (PLDA) space. Compared to other modeling approaches (such as cosine distance scoring), PLDA provides a more efficient mechanism to separate speaker information from other sources of undesired variabilities and offers superior speaker verification performance. Unfortunately, this efficiency is obtained at the cost of a required large corpus of labeled development data, which is too expensive/unrealistic in many cases. This study investigates a potential solution to resolve this challenge by effectively utilizing unlabeled development data with universal imposter clustering. The proposed method offers +21.9% and +34.6% relative gains versus the baseline system on two public available corpora, respectively. This significant improvement proves the effectiveness of the proposed method.

引用

页码：418 / 423

页数：6

共 50 条

[1] Improving Deep Neural Networks Based Speaker Verification Using Unlabeled Data
Tian, Yao
Cai, Meng
He, Liang
Zhang, Wei-Qiang
Liu, Jia
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1863 - 1867
[2] IMPROVING PLDA SPEAKER VERIFICATION WITH LIMITED DEVELOPMENT DATA
Kanagasundaram, Ahilan
Dean, David
Sridharan, Sridha
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[3] Speaker Verification with the Constraint of Limited Data
Kumari, Thyamagondlu Renukamurthy Jayanthi
Jayanna, Haradagere Siddaramaiah
JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2018, 14 (04): : 807 - 823
[4] Speaker adaptations in sparse training data for improved speaker verification
Ahn, S
Ko, H
ELECTRONICS LETTERS, 2000, 36 (04) : 371 - 373
[5] Development and implementation of speaker verification algorithms
Etter, D
Bradley, B
Dickeson, M
THIRTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1997, : 753 - 757
[6] Improving Robustness of Speaker Recognition to New Conditions Using Unlabeled Data
Castan, Diego
McLaren, Mitchell
Ferrer, Luciana
Lawson, Aaron
Lozano-Diez, Alicia
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3737 - 3741
[7] Utilization of unlabeled data for Smartphone-robot Localization
Yoo, Jaehyun
Kim, H. Jin
2016 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATIONS (ICEIC), 2016,
[8] An analysis of data fusion methods for speaker verification
Farrell, KR
Ramachandran, RP
Mammone, RJ
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1129 - 1132
[9] PLDA Speaker Verification with Limited Speech Data
Ridzik, Andrej
Rusko, Milan
SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 325 - 332
[10] Speaker verification under mismatched data conditions
Pillay, S. G.
Ariyaeeinia, A.
Pawlewski, M.
Sivakumaran, P.
IET SIGNAL PROCESSING, 2009, 3 (04) : 236 - 246

← 1 2 3 4 5 →