UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation

被引:4
|
作者
Zhang, Chunlei [1 ]
Bahmaninezhad, Fahimeh [1 ]
Ranjan, Shivesh [1 ]
Yu, Chengzhu [1 ]
Shokouhi, Navid [1 ]
Hansen, John H. L. [1 ]
机构
[1] Univ Texas Dallas, CRSS, Erik Jonsson Sch Engn, Richardson, TX 75080 USA
关键词
NIST SRE; speaker recognition; domain mismatch; i-vector; speaker clustering;
D O I
10.21437/Interspeech.2017-555
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study describes systems submitted by the Center for Robust Speech Systems (CRSS) from the University of Texas at Dallas (UTD) to the 2016 National Institute of Standards and Technology (NIST) Speaker Recognition Evaluation (SRE). We developed 4 UBM and DNN i-vector based speaker recognition systems with alternate data sets and feature representations. Given that the emphasis of the NIST SRE 2016 is on language mismatch between training and enrollment/test data. so-called domain mismatch. in our system development we focused on: (i) utilizing unlabeled in-domain data for centralizing i-vectors to alleviate the domain mismatch; (ii) selecting the proper data sets and optimizing configurations for training LDA/PLDA; (iii) introducing a newly proposed dimension reduction technique which incorporates unlabeled in-domain data before PLDA training: (iv) unsupervised speaker clustering of unlabeled data and using them alone or with previous SREs for PLDA training, and finally (v) score calibration using unlabeled data with "pseudo" speaker labels generated from speaker clustering. NIST evaluations show that our proposed methods were very successful for the given task.
引用
收藏
页码:1343 / 1347
页数:5
相关论文
共 50 条
  • [31] THE 14U SYSTEM IN NIST 2008 SPEAKER RECOGNITION EVALUATION
    Li, Haizhou
    Ma, Bin
    Lee, Kong-Aik
    Sun, Hanwu
    Zhu, Donglai
    Sim, Khe Chai
    You, Changhuai
    Tong, Rong
    Kaerkkaeinen, Ismo
    Huang, Chien-Lin
    Pervouchine, Vladimir
    Guo, Wu
    Li, Yijie
    Dai, Lirong
    Nosratighods, Mohaddeseh
    Tharmarajah, Thiruvaran
    Epps, Julien
    Ambikairajah, Eliathamby
    Chng, Eng-Siong
    Schultz, Tanja
    Jin, Qin
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4201 - +
  • [32] The DKU-SMIIP System for NIST 2018 Speaker Recognition Evaluation
    Cai, Danwei
    Gai, Weicheng
    Li, Ming
    INTERSPEECH 2019, 2019, : 4370 - 4374
  • [33] VOICEAI SYSTEMS TO NIST SRE19 EVALUATION: ROBUST SPEAKER RECOGNITION ON CONVERSATIONAL TELEPHONE SPEECH
    Li, Rongjin
    Chen, Dongpeng
    Zhang, Weibin
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6459 - 6463
  • [34] The ELISA Systems for the NIST'99 evaluation in speaker detection and tracking
    Bimbot, F
    Blouet, R
    Bonastre, JF
    Caloz, G
    Cernocky, J
    Chollet, G
    Durou, G
    Fredouille, C
    Genoud, D
    Gravier, G
    Hennebert, J
    Kharroubi, J
    Magrin-Chagnolleau, I
    Merlin, T
    Mokbel, C
    Nedic, B
    Petrovska-Delacrétaz, D
    Pigeon, S
    Seck, M
    Verlinde, P
    Zouhal, M
    DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 143 - 153
  • [35] UTD-CRSS SUBMISSION FOR MGB-3 ARABIC DIALECT IDENTIFICATION: FRONT-END AND BACK-END ADVANCEMENTS ON BROADCAST SPEECH
    Bulut, Ahmet E.
    Zhang, Qian
    Zhang, Chunlei
    Bahmaninezhad, Fahimeh
    Hansen, John H. L.
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 360 - 367
  • [36] Corpora for the evaluation of speaker recognition systems
    Campbell, JP
    Reynolds, DA
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 829 - 832
  • [37] Corpora for the evaluation of speaker recognition systems
    Dep of Defense, Ft. Meade, United States
    ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (829-832):
  • [38] Comparison of Voice Activity Detectors for Interview Speech in NIST Speaker Recognition Evaluation
    Yu, Hon-Bill
    Mak, Man-Wai
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2364 - +
  • [39] HLT-NUS Submission for 2019 NIST Multimedia Speaker Recognition Evaluation
    Das, Rohan Kumar
    Tao, Ruijie
    Yang, Jichen
    Rao, Wei
    Yu, Cheng
    Li, Haizhou
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 605 - 609
  • [40] The IIR NIST SRE 2008 and 2010 Summed Channel Speaker Recognition Systems
    Sun, Hanwu
    Ma, Bin
    Huang, Chien-Lin
    Trung Hieu Nguyen
    Li, Haizhou
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 366 - 369