THE LEAP SPEAKER RECOGNITION SYSTEM FOR NIST SRE 2018 CHALLENGE

被引：0

作者：

Ramoji, Shreyas ^{[1
]}

Mohan, Anand ^{[1
]}

Mysore, Bhargavram ^{[2
]}

Bhatia, Anmol ^{[3
]}

Singh, Prachi ^{[1
]}

Vardhan, Harsha ^{[1
]}

Ganapathy, Sriram ^{[1
]}

机构：

[1] Indian Inst Sci, Elect Engn, Learning & Extract Acoust Patterns LEAP Lab, Bengaluru, India

[2] North Carolina State Univ, Raleigh, NC USA

[3] Birla Inst Technol & Sci BITS Pilani, Pilani, Rajasthan, India

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年

关键词：

x-vectors; Speaker Diarization; PLDA scoring; Gaussian back-end; Dimensionality Reduction; Speaker Verification; SUPPORT VECTOR MACHINES; VERIFICATION; END;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The NIST Speaker Recognition Evaluation (SRE) 2018 challenge comprises an open evaluation of the text independent speaker verification task. This paper summarizes the LEAP speaker verification systems submitted to the NIST SRE 2018. For all the speaker verification approaches, the front-end feature extraction involved the use of neural embeddings from a time delay neural network (TDNN) trained on a speaker discrimination task. These features, called x vectors, are used in multiple ways for speaker verification task. In the first approach, the x-vectors with pre-processing and dimensionality reduction, are used with probabilistic linear discriminant analysis (PLDA) scoring. The second approach applies a speaker diarizanon scheme on the test segments containing multiple talkers before speaker verification scoring based on PLDA. The third system uses a local pairwise LDA model for pre-processing the x-vectors which are then scored using a Gaussian back-end. With experiments on the SRE 2018 database, we show that most of the systems achieved noticeable improvements over the NIST baseline in terms of the primary cost metric. Using a system fusion of the various approaches, we obtain significant improvements over the NIST official baseline (average relative improvements of 19.7% and 20.1% for the development and evaluation set respectively).

引用

下载

页码：5771 / 5775

页数：5

共 50 条

[41] MICROSOFT SPEAKER DIARIZATION SYSTEM FOR THE VOXCELEB SPEAKER RECOGNITION CHALLENGE 2020
Xiao, Xiong
Kanda, Naoyuki
Chen, Zhuo
Zhou, Tianyan
Yoshioka, Takuya
Chen, Sanyuan
Zhao, Yong
Liu, Gang
Wu, Yu
Wu, Jian
Liu, Shujie
Li, Jinyu
Gong, Yifan
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5824 - 5828
[42] The BUCEA Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2022
Zhou, Ruohua
Du, Yuxuan
Hu, Chenlei
arXiv, 2022,
[43] The NIST 1999 Speaker Recognition Evaluation - An overview
Martin, A
Przybocki, M
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 1 - 18
[44] A DENOISING AUTOENCODER FOR SPEAKER RECOGNITION. RESULTS ON THE MCE 2018 CHALLENGE
Font, Roberto
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6016 - 6020
[45] Robust speaker recognition with cross-channel data: MIT-LL results on the 2006 NIST SRE auxiliary microphone task
Sturim, D. E.
Campbell, W. M.
Reynolds, D. A.
Dunn, R. B.
Quatieri, T. F.
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 49 - +
[46] The JHU Speaker Recognition System for the VOiCES 2019 Challenge
Snyder, David
Villalba, Jesus
Chen, Nanxin
Povey, Daniel
Sell, Gregory
Dehak, Najim
Khudanpur, Sanjeev
INTERSPEECH 2019, 2019, : 2468 - 2472
[47] XMU-TS SYSTEMS FOR NIST SRE19 CTS CHALLENGE
Lu, Hao
Zhou, Jianfeng
Zhao, Miao
Lei, Wendian
Hong, Qingyang
Li, Lin
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7569 - 7573
[48] Speaker diarization system on the 2007 NIST rich transcription meeting recognition evaluation
Sun, Hanwu
Nwe, Tin Lay
Chin, Eugene
Koh, Wei
Bin, Ma
Li, Haizhou
MULTIMEDIA SYSTEMS AND APPLICATIONS X, 2007, 6777
[49] THU-EE System Fusion for the NIST 2012 Speaker Recognition Evaluation
Zhang, Wei-Qiang
Li, Zhi-Yi
Liu, Weiwei
Liu, Jia
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2473 - 2477
[50] NIST launches iris recognition challenge
不详
PHOTONICS SPECTRA, 2005, 39 (10) : 53 - 53

← 1 2 3 4 5 →