UTILIZATION OF UNLABELED DEVELOPMENT DATA FOR SPEAKER VERIFICATION

被引:0
|
作者
Liu, Gang [1 ]
Yu, Chengzhu [1 ]
Shokouhi, Navid [1 ]
Misra, Abhinav [1 ]
Xing, Hua [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, CRSS, Richardson, TX 75080 USA
关键词
Clustering; Speaker verification; PLDA; i-Vector; Universal imposter clustering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State-of-the-art speaker verification systems model speaker identity by mapping i-Vectors onto a probabilistic linear discriminant analysis (PLDA) space. Compared to other modeling approaches (such as cosine distance scoring), PLDA provides a more efficient mechanism to separate speaker information from other sources of undesired variabilities and offers superior speaker verification performance. Unfortunately, this efficiency is obtained at the cost of a required large corpus of labeled development data, which is too expensive/unrealistic in many cases. This study investigates a potential solution to resolve this challenge by effectively utilizing unlabeled development data with universal imposter clustering. The proposed method offers +21.9% and +34.6% relative gains versus the baseline system on two public available corpora, respectively. This significant improvement proves the effectiveness of the proposed method.
引用
收藏
页码:418 / 423
页数:6
相关论文
共 50 条
  • [41] Secure Computation for Biometric Data Security-Application to Speaker Verification
    Sy, Bon K.
    IEEE SYSTEMS JOURNAL, 2009, 3 (04): : 451 - 460
  • [42] Data-driven temporal filters and alternatives to GMM in speaker verification
    Malayath, N
    Hermansky, H
    Kajarekar, S
    Yegnanarayana, B
    DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 55 - 74
  • [43] HMM SPEAKER VERIFICATION WITH SPARSE TRAINING DATA ON TELEPHONE QUALITY SPEECH
    FORSYTH, ME
    SUTHERLAND, AM
    ELLIOTT, JA
    JACK, MA
    SPEECH COMMUNICATION, 1993, 13 (3-4) : 411 - 416
  • [44] Enrollee-constrained sparse coding of test data for speaker verification
    Kumar, Nagendra
    Sinha, Rohit
    PATTERN RECOGNITION LETTERS, 2018, 116 : 15 - 21
  • [45] A comparison of various adaptation methods for speaker verification with limited enrollment data
    Mak, Man-Wai
    Hsiao, Roger
    Mak, Brian
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 929 - 932
  • [46] ROBUST SPEAKER VERIFICATION USING POPULATION-BASED DATA AUGMENTATION
    Lin, Weiwei
    Mak, Man-Wai
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7642 - 7646
  • [47] On Leveraging Conversational Data for Building a Text Dependent Speaker Verification System
    Aronowitz, Hagai
    Barkan, Oren
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2469 - 2472
  • [48] Exploring Acoustic Factor Analysis for Limited Test Data Speaker Verification
    Mamodiya, Salil
    Kumar, Lay
    Das, Rohan Kumar
    Prasanna, S. R. Mahadeva
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 1397 - 1401
  • [49] System Source and Dynamic Features for Speaker Verification for Limited Data Condition
    Kumari, T. R. Jayanthi
    Jayanna, H. S.
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 1458 - 1461
  • [50] Investigating the Effect of Data Partitioning for GMM Supervector Based Speaker Verification
    Dikici, Erinc
    Saraclar, Murat
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 464 - 469