THE LEAP SPEAKER RECOGNITION SYSTEM FOR NIST SRE 2018 CHALLENGE

被引:0
|
作者
Ramoji, Shreyas [1 ]
Mohan, Anand [1 ]
Mysore, Bhargavram [2 ]
Bhatia, Anmol [3 ]
Singh, Prachi [1 ]
Vardhan, Harsha [1 ]
Ganapathy, Sriram [1 ]
机构
[1] Indian Inst Sci, Elect Engn, Learning & Extract Acoust Patterns LEAP Lab, Bengaluru, India
[2] North Carolina State Univ, Raleigh, NC USA
[3] Birla Inst Technol & Sci BITS Pilani, Pilani, Rajasthan, India
关键词
x-vectors; Speaker Diarization; PLDA scoring; Gaussian back-end; Dimensionality Reduction; Speaker Verification; SUPPORT VECTOR MACHINES; VERIFICATION; END;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The NIST Speaker Recognition Evaluation (SRE) 2018 challenge comprises an open evaluation of the text independent speaker verification task. This paper summarizes the LEAP speaker verification systems submitted to the NIST SRE 2018. For all the speaker verification approaches, the front-end feature extraction involved the use of neural embeddings from a time delay neural network (TDNN) trained on a speaker discrimination task. These features, called x vectors, are used in multiple ways for speaker verification task. In the first approach, the x-vectors with pre-processing and dimensionality reduction, are used with probabilistic linear discriminant analysis (PLDA) scoring. The second approach applies a speaker diarizanon scheme on the test segments containing multiple talkers before speaker verification scoring based on PLDA. The third system uses a local pairwise LDA model for pre-processing the x-vectors which are then scored using a Gaussian back-end. With experiments on the SRE 2018 database, we show that most of the systems achieved noticeable improvements over the NIST baseline in terms of the primary cost metric. Using a system fusion of the various approaches, we obtain significant improvements over the NIST official baseline (average relative improvements of 19.7% and 20.1% for the development and evaluation set respectively).
引用
下载
收藏
页码:5771 / 5775
页数:5
相关论文
共 50 条
  • [1] The NIST SRE Summed Channel Speaker Recognition System
    Sun, Hanwu
    Ma, Bin
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1111 - 1114
  • [2] Speaker recognition - The ATVS-UAM system at NIST SRE 05
    Gonzalez-Rodriguez, Joaquin
    Ramos-Castro, Daniel
    Toledano, Doroteo Torre
    Montero-Asenjo, Alberto
    Gonzalez-Dominguez, Javier
    Lopez-Moreno, Ignacio
    Fierrez-Aguilar, Julian
    Garcia-Romero, Daniel
    Ortega-Garcia, Javier
    IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2007, 22 (01) : 15 - 21
  • [3] Human Assisted Speaker Recognition In NIST SRE10
    Greenberg, Craig
    Martin, Alvin
    Brandschain, Linda
    Campbell, Joseph
    Cieri, Christopher
    Doddington, George
    Godfrey, John
    ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 180 - 185
  • [4] The 2018 NIST Speaker Recognition Evaluation
    Sadjadi, Seyed Omid
    Greenberg, Craig
    Singer, Elliot
    Reynolds, Douglas
    Mason, Lisa
    Hernandez-Cordero, Jaime
    INTERSPEECH 2019, 2019, : 1483 - 1487
  • [5] 13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE
    Matejka, Pavel
    Plchot, Oldrich
    Glembek, Ondrej
    Burget, Lukas
    Rohdin, Johan
    Zeinali, Hossein
    Mosner, Ladislav
    Silnova, Anna
    Novotny, Ondrej
    Diez, Mireia
    Cernocky, Jan Honza
    COMPUTER SPEECH AND LANGUAGE, 2020, 63
  • [6] Evaluation of a Fused FM and Cepstral-Based Speaker Recognition System on the NIST 2008 SRE
    Nosratighods, Mohaddeseh
    Thiruvaran, Tharmarajah
    Epps, Julien
    Ambikairajah, Eliathamby
    Ma, Bin
    Li, Haizhou
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4233 - +
  • [7] Study of Overlapped Speech Detection for NIST SRE Summed Channel Speaker Recognition
    Sun, Hanwu
    Ma, Bin
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2356 - +
  • [8] Uncertainty propagation for noise robust speaker recognition: the case of NIST-SRE
    Ribas, Dayana
    Vincent, Emmanuel
    Calvo, Jose Ramon
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3536 - 3540
  • [9] The IIR NIST SRE 2008 and 2010 Summed Channel Speaker Recognition Systems
    Sun, Hanwu
    Ma, Bin
    Huang, Chien-Lin
    Trung Hieu Nguyen
    Li, Haizhou
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 366 - 369
  • [10] The DKU-SMIIP System for NIST 2018 Speaker Recognition Evaluation
    Cai, Danwei
    Gai, Weicheng
    Li, Ming
    INTERSPEECH 2019, 2019, : 4370 - 4374