THE LEAP SPEAKER RECOGNITION SYSTEM FOR NIST SRE 2018 CHALLENGE

被引:0
|
作者
Ramoji, Shreyas [1 ]
Mohan, Anand [1 ]
Mysore, Bhargavram [2 ]
Bhatia, Anmol [3 ]
Singh, Prachi [1 ]
Vardhan, Harsha [1 ]
Ganapathy, Sriram [1 ]
机构
[1] Indian Inst Sci, Elect Engn, Learning & Extract Acoust Patterns LEAP Lab, Bengaluru, India
[2] North Carolina State Univ, Raleigh, NC USA
[3] Birla Inst Technol & Sci BITS Pilani, Pilani, Rajasthan, India
关键词
x-vectors; Speaker Diarization; PLDA scoring; Gaussian back-end; Dimensionality Reduction; Speaker Verification; SUPPORT VECTOR MACHINES; VERIFICATION; END;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The NIST Speaker Recognition Evaluation (SRE) 2018 challenge comprises an open evaluation of the text independent speaker verification task. This paper summarizes the LEAP speaker verification systems submitted to the NIST SRE 2018. For all the speaker verification approaches, the front-end feature extraction involved the use of neural embeddings from a time delay neural network (TDNN) trained on a speaker discrimination task. These features, called x vectors, are used in multiple ways for speaker verification task. In the first approach, the x-vectors with pre-processing and dimensionality reduction, are used with probabilistic linear discriminant analysis (PLDA) scoring. The second approach applies a speaker diarizanon scheme on the test segments containing multiple talkers before speaker verification scoring based on PLDA. The third system uses a local pairwise LDA model for pre-processing the x-vectors which are then scored using a Gaussian back-end. With experiments on the SRE 2018 database, we show that most of the systems achieved noticeable improvements over the NIST baseline in terms of the primary cost metric. Using a system fusion of the various approaches, we obtain significant improvements over the NIST official baseline (average relative improvements of 19.7% and 20.1% for the development and evaluation set respectively).
引用
下载
收藏
页码:5771 / 5775
页数:5
相关论文
共 50 条
  • [41] MICROSOFT SPEAKER DIARIZATION SYSTEM FOR THE VOXCELEB SPEAKER RECOGNITION CHALLENGE 2020
    Xiao, Xiong
    Kanda, Naoyuki
    Chen, Zhuo
    Zhou, Tianyan
    Yoshioka, Takuya
    Chen, Sanyuan
    Zhao, Yong
    Liu, Gang
    Wu, Yu
    Wu, Jian
    Liu, Shujie
    Li, Jinyu
    Gong, Yifan
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5824 - 5828
  • [42] The BUCEA Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2022
    Zhou, Ruohua
    Du, Yuxuan
    Hu, Chenlei
    arXiv, 2022,
  • [43] The NIST 1999 Speaker Recognition Evaluation - An overview
    Martin, A
    Przybocki, M
    DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 1 - 18
  • [44] A DENOISING AUTOENCODER FOR SPEAKER RECOGNITION. RESULTS ON THE MCE 2018 CHALLENGE
    Font, Roberto
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6016 - 6020
  • [45] Robust speaker recognition with cross-channel data: MIT-LL results on the 2006 NIST SRE auxiliary microphone task
    Sturim, D. E.
    Campbell, W. M.
    Reynolds, D. A.
    Dunn, R. B.
    Quatieri, T. F.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 49 - +
  • [46] The JHU Speaker Recognition System for the VOiCES 2019 Challenge
    Snyder, David
    Villalba, Jesus
    Chen, Nanxin
    Povey, Daniel
    Sell, Gregory
    Dehak, Najim
    Khudanpur, Sanjeev
    INTERSPEECH 2019, 2019, : 2468 - 2472
  • [47] XMU-TS SYSTEMS FOR NIST SRE19 CTS CHALLENGE
    Lu, Hao
    Zhou, Jianfeng
    Zhao, Miao
    Lei, Wendian
    Hong, Qingyang
    Li, Lin
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7569 - 7573
  • [48] Speaker diarization system on the 2007 NIST rich transcription meeting recognition evaluation
    Sun, Hanwu
    Nwe, Tin Lay
    Chin, Eugene
    Koh, Wei
    Bin, Ma
    Li, Haizhou
    MULTIMEDIA SYSTEMS AND APPLICATIONS X, 2007, 6777
  • [49] THU-EE System Fusion for the NIST 2012 Speaker Recognition Evaluation
    Zhang, Wei-Qiang
    Li, Zhi-Yi
    Liu, Weiwei
    Liu, Jia
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2473 - 2477
  • [50] NIST launches iris recognition challenge
    不详
    PHOTONICS SPECTRA, 2005, 39 (10) : 53 - 53