THE LEAP SPEAKER RECOGNITION SYSTEM FOR NIST SRE 2018 CHALLENGE

被引：0

作者：

Ramoji, Shreyas ^{[1
]}

Mohan, Anand ^{[1
]}

Mysore, Bhargavram ^{[2
]}

Bhatia, Anmol ^{[3
]}

Singh, Prachi ^{[1
]}

Vardhan, Harsha ^{[1
]}

Ganapathy, Sriram ^{[1
]}

机构：

[1] Indian Inst Sci, Elect Engn, Learning & Extract Acoust Patterns LEAP Lab, Bengaluru, India

[2] North Carolina State Univ, Raleigh, NC USA

[3] Birla Inst Technol & Sci BITS Pilani, Pilani, Rajasthan, India

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年

关键词：

x-vectors; Speaker Diarization; PLDA scoring; Gaussian back-end; Dimensionality Reduction; Speaker Verification; SUPPORT VECTOR MACHINES; VERIFICATION; END;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The NIST Speaker Recognition Evaluation (SRE) 2018 challenge comprises an open evaluation of the text independent speaker verification task. This paper summarizes the LEAP speaker verification systems submitted to the NIST SRE 2018. For all the speaker verification approaches, the front-end feature extraction involved the use of neural embeddings from a time delay neural network (TDNN) trained on a speaker discrimination task. These features, called x vectors, are used in multiple ways for speaker verification task. In the first approach, the x-vectors with pre-processing and dimensionality reduction, are used with probabilistic linear discriminant analysis (PLDA) scoring. The second approach applies a speaker diarizanon scheme on the test segments containing multiple talkers before speaker verification scoring based on PLDA. The third system uses a local pairwise LDA model for pre-processing the x-vectors which are then scored using a Gaussian back-end. With experiments on the SRE 2018 database, we show that most of the systems achieved noticeable improvements over the NIST baseline in terms of the primary cost metric. Using a system fusion of the various approaches, we obtain significant improvements over the NIST official baseline (average relative improvements of 19.7% and 20.1% for the development and evaluation set respectively).

引用

下载

页码：5771 / 5775

页数：5

共 50 条

[1] The NIST SRE Summed Channel Speaker Recognition System
Sun, Hanwu
Ma, Bin
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1111 - 1114
[2] Speaker recognition - The ATVS-UAM system at NIST SRE 05
Gonzalez-Rodriguez, Joaquin
Ramos-Castro, Daniel
Toledano, Doroteo Torre
Montero-Asenjo, Alberto
Gonzalez-Dominguez, Javier
Lopez-Moreno, Ignacio
Fierrez-Aguilar, Julian
Garcia-Romero, Daniel
Ortega-Garcia, Javier
IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2007, 22 (01) : 15 - 21
[3] Human Assisted Speaker Recognition In NIST SRE10
Greenberg, Craig
Martin, Alvin
Brandschain, Linda
Campbell, Joseph
Cieri, Christopher
Doddington, George
Godfrey, John
ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 180 - 185
[4] The 2018 NIST Speaker Recognition Evaluation
Sadjadi, Seyed Omid
Greenberg, Craig
Singer, Elliot
Reynolds, Douglas
Mason, Lisa
Hernandez-Cordero, Jaime
INTERSPEECH 2019, 2019, : 1483 - 1487
[5] 13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE
Matejka, Pavel
Plchot, Oldrich
Glembek, Ondrej
Burget, Lukas
Rohdin, Johan
Zeinali, Hossein
Mosner, Ladislav
Silnova, Anna
Novotny, Ondrej
Diez, Mireia
Cernocky, Jan Honza
COMPUTER SPEECH AND LANGUAGE, 2020, 63
[6] Evaluation of a Fused FM and Cepstral-Based Speaker Recognition System on the NIST 2008 SRE
Nosratighods, Mohaddeseh
Thiruvaran, Tharmarajah
Epps, Julien
Ambikairajah, Eliathamby
Ma, Bin
Li, Haizhou
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4233 - +
[7] Study of Overlapped Speech Detection for NIST SRE Summed Channel Speaker Recognition
Sun, Hanwu
Ma, Bin
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2356 - +
[8] Uncertainty propagation for noise robust speaker recognition: the case of NIST-SRE
Ribas, Dayana
Vincent, Emmanuel
Calvo, Jose Ramon
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3536 - 3540
[9] The IIR NIST SRE 2008 and 2010 Summed Channel Speaker Recognition Systems
Sun, Hanwu
Ma, Bin
Huang, Chien-Lin
Trung Hieu Nguyen
Li, Haizhou
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 366 - 369
[10] The DKU-SMIIP System for NIST 2018 Speaker Recognition Evaluation
Cai, Danwei
Gai, Weicheng
Li, Ming
INTERSPEECH 2019, 2019, : 4370 - 4374

← 1 2 3 4 5 →