UTILIZATION OF UNLABELED DEVELOPMENT DATA FOR SPEAKER VERIFICATION

被引:0
|
作者
Liu, Gang [1 ]
Yu, Chengzhu [1 ]
Shokouhi, Navid [1 ]
Misra, Abhinav [1 ]
Xing, Hua [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, CRSS, Richardson, TX 75080 USA
关键词
Clustering; Speaker verification; PLDA; i-Vector; Universal imposter clustering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State-of-the-art speaker verification systems model speaker identity by mapping i-Vectors onto a probabilistic linear discriminant analysis (PLDA) space. Compared to other modeling approaches (such as cosine distance scoring), PLDA provides a more efficient mechanism to separate speaker information from other sources of undesired variabilities and offers superior speaker verification performance. Unfortunately, this efficiency is obtained at the cost of a required large corpus of labeled development data, which is too expensive/unrealistic in many cases. This study investigates a potential solution to resolve this challenge by effectively utilizing unlabeled development data with universal imposter clustering. The proposed method offers +21.9% and +34.6% relative gains versus the baseline system on two public available corpora, respectively. This significant improvement proves the effectiveness of the proposed method.
引用
收藏
页码:418 / 423
页数:6
相关论文
共 50 条
  • [1] Improving Deep Neural Networks Based Speaker Verification Using Unlabeled Data
    Tian, Yao
    Cai, Meng
    He, Liang
    Zhang, Wei-Qiang
    Liu, Jia
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1863 - 1867
  • [2] IMPROVING PLDA SPEAKER VERIFICATION WITH LIMITED DEVELOPMENT DATA
    Kanagasundaram, Ahilan
    Dean, David
    Sridharan, Sridha
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] Speaker Verification with the Constraint of Limited Data
    Kumari, Thyamagondlu Renukamurthy Jayanthi
    Jayanna, Haradagere Siddaramaiah
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2018, 14 (04): : 807 - 823
  • [4] Speaker adaptations in sparse training data for improved speaker verification
    Ahn, S
    Ko, H
    ELECTRONICS LETTERS, 2000, 36 (04) : 371 - 373
  • [5] Development and implementation of speaker verification algorithms
    Etter, D
    Bradley, B
    Dickeson, M
    THIRTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1997, : 753 - 757
  • [6] Improving Robustness of Speaker Recognition to New Conditions Using Unlabeled Data
    Castan, Diego
    McLaren, Mitchell
    Ferrer, Luciana
    Lawson, Aaron
    Lozano-Diez, Alicia
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3737 - 3741
  • [7] Utilization of unlabeled data for Smartphone-robot Localization
    Yoo, Jaehyun
    Kim, H. Jin
    2016 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATIONS (ICEIC), 2016,
  • [8] An analysis of data fusion methods for speaker verification
    Farrell, KR
    Ramachandran, RP
    Mammone, RJ
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1129 - 1132
  • [9] PLDA Speaker Verification with Limited Speech Data
    Ridzik, Andrej
    Rusko, Milan
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 325 - 332
  • [10] Speaker verification under mismatched data conditions
    Pillay, S. G.
    Ariyaeeinia, A.
    Pawlewski, M.
    Sivakumaran, P.
    IET SIGNAL PROCESSING, 2009, 3 (04) : 236 - 246