The JHU Speaker Recognition System for the VOiCES 2019 Challenge

被引：24

作者：

Snyder, David ^{[1
,2
]}

Villalba, Jesus ^{[1
]}

Chen, Nanxin ^{[1
]}

Povey, Daniel ^{[1
,2
]}

Sell, Gregory ^{[2
]}

Dehak, Najim ^{[1
]}

Khudanpur, Sanjeev ^{[1
,2
]}

机构：

[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA

[2] Johns Hopkins Univ, Human Language Technol Ctr Excellence, Baltimore, MD 21218 USA

来源：

INTERSPEECH 2019 | 2019年

关键词：

speaker recognition; VOiCES Challenge 2019;

D O I：

10.21437/Interspeech.2019-2979

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

This paper describes the systems developed by the JHU team for the speaker recognition track of the 2019 VOiCES from a Distance Challenge. On this far-field task, we achieved good performance using systems based on state-of-the-art deep neural network (DNN) embeddings. In this paradigm, a DNN maps variable-length speech segments to speaker embeddings, called x-vectors, that are then classified using probabilistic linear discriminant analysis (PLDA). Our submissions were composed of three x-vector-based systems that differed primarily in the DNN architecture, temporal pooling mechanism, and training objective function. On the evaluation set, our best single-system submission used an extended time-delay architecture, and achieved 0.435 in actual DCF, the primary evaluation metric. A fusion of all three x-vector systems was our primary submission, and it obtained an actual DCF of 0.362.

引用

页码：2468 / 2472

页数：5

共 50 条

[41] Design and implementation of a speaker recognition system
Han, Y.
Chen, L.H.
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science Edition, 2001, 35 (02):
[42] A CEPSTRAL BASED SPEAKER RECOGNITION SYSTEM
SETHURAMAN, R
GOWDY, JN
PROCEEDINGS : THE TWENTY-FIRST SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY, 1989, : 503 - 507
[43] A COMBINED SPEAKER AND DIGIT RECOGNITION SYSTEM
JAMALI, MM
MOHANKRISHNAN, N
SHRIDHAR, M
OHIO JOURNAL OF SCIENCE, 1987, 87 (02) : 33 - 33
[44] Performance evaluation of speaker recognition system
Palia, Nivedita
Kant, Shri
Dev, Amita
JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2019, 22 (02): : 203 - 218
[45] Research on robust speaker recognition system
Yingjun, Lu
Cancan, Chong
Anling, Xu
Maoyong, Cao
2007 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY, PROCEEDINGS, 2007, : 698 - 701
[46] WORD AND SPEAKER RECOGNITION SYSTEM ON MATLAB
Fei, Tan Shwu
Awan, Mohammad
PROCEEDINGS OF THE 2011 3RD INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGY AND ENGINEERING (ICSTE 2011), 2011, : 19 - 23
[47] Speaker Recognition System for Security Applications
Selvan, Karthik
Joseph, Aju
Babu, Anish K. K.
2013 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2013, : 26 - 30
[48] State-of-the-art Speaker Recognition for Telephone and Video Speech: the JHU-MIT Submission for NIST SRE18
Villalba, Jesus
Chen, Nanxin
Snyder, David
Garcia-Romero, Daniel
McCree, Alan
Sell, Gregory
Borgstrom, Jonas
Richardson, Fred
Shon, Suwon
Grondin, Francois
Dehak, Reda
Garcia-Perera, Leibny Paola
Povey, Daniel
Torres-Carrasquillo, Pedro A.
Khudanpur, Sanjeev
Dehak, Najim
INTERSPEECH 2019, 2019, : 1488 - 1492
[49] Speaker adaptation by modeling the speaker variation in a continuous speech recognition system
Strom, N
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 989 - 992
[50] STC-innovation Speaker Recognition Systems for Far-Field Speaker Verification Challenge 2020
Gusev, Aleksei
Volokhov, Vladimir
Vinogradova, Alisa
Andzhukaev, Tseren
Shulipa, Andrey
Novoselov, Sergey
Pekhovsky, Timur
Kozlov, Alexander
INTERSPEECH 2020, 2020, : 3466 - 3470

← 1 2 3 4 5 →