The JHU Speaker Recognition System for the VOiCES 2019 Challenge

被引:24
|
作者
Snyder, David [1 ,2 ]
Villalba, Jesus [1 ]
Chen, Nanxin [1 ]
Povey, Daniel [1 ,2 ]
Sell, Gregory [2 ]
Dehak, Najim [1 ]
Khudanpur, Sanjeev [1 ,2 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
[2] Johns Hopkins Univ, Human Language Technol Ctr Excellence, Baltimore, MD 21218 USA
来源
关键词
speaker recognition; VOiCES Challenge 2019;
D O I
10.21437/Interspeech.2019-2979
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper describes the systems developed by the JHU team for the speaker recognition track of the 2019 VOiCES from a Distance Challenge. On this far-field task, we achieved good performance using systems based on state-of-the-art deep neural network (DNN) embeddings. In this paradigm, a DNN maps variable-length speech segments to speaker embeddings, called x-vectors, that are then classified using probabilistic linear discriminant analysis (PLDA). Our submissions were composed of three x-vector-based systems that differed primarily in the DNN architecture, temporal pooling mechanism, and training objective function. On the evaluation set, our best single-system submission used an extended time-delay architecture, and achieved 0.435 in actual DCF, the primary evaluation metric. A fusion of all three x-vector systems was our primary submission, and it obtained an actual DCF of 0.362.
引用
收藏
页码:2468 / 2472
页数:5
相关论文
共 50 条
  • [41] Design and implementation of a speaker recognition system
    Han, Y.
    Chen, L.H.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science Edition, 2001, 35 (02):
  • [42] A CEPSTRAL BASED SPEAKER RECOGNITION SYSTEM
    SETHURAMAN, R
    GOWDY, JN
    PROCEEDINGS : THE TWENTY-FIRST SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY, 1989, : 503 - 507
  • [43] A COMBINED SPEAKER AND DIGIT RECOGNITION SYSTEM
    JAMALI, MM
    MOHANKRISHNAN, N
    SHRIDHAR, M
    OHIO JOURNAL OF SCIENCE, 1987, 87 (02) : 33 - 33
  • [44] Performance evaluation of speaker recognition system
    Palia, Nivedita
    Kant, Shri
    Dev, Amita
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2019, 22 (02): : 203 - 218
  • [45] Research on robust speaker recognition system
    Yingjun, Lu
    Cancan, Chong
    Anling, Xu
    Maoyong, Cao
    2007 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY, PROCEEDINGS, 2007, : 698 - 701
  • [46] WORD AND SPEAKER RECOGNITION SYSTEM ON MATLAB
    Fei, Tan Shwu
    Awan, Mohammad
    PROCEEDINGS OF THE 2011 3RD INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGY AND ENGINEERING (ICSTE 2011), 2011, : 19 - 23
  • [47] Speaker Recognition System for Security Applications
    Selvan, Karthik
    Joseph, Aju
    Babu, Anish K. K.
    2013 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2013, : 26 - 30
  • [48] State-of-the-art Speaker Recognition for Telephone and Video Speech: the JHU-MIT Submission for NIST SRE18
    Villalba, Jesus
    Chen, Nanxin
    Snyder, David
    Garcia-Romero, Daniel
    McCree, Alan
    Sell, Gregory
    Borgstrom, Jonas
    Richardson, Fred
    Shon, Suwon
    Grondin, Francois
    Dehak, Reda
    Garcia-Perera, Leibny Paola
    Povey, Daniel
    Torres-Carrasquillo, Pedro A.
    Khudanpur, Sanjeev
    Dehak, Najim
    INTERSPEECH 2019, 2019, : 1488 - 1492
  • [49] Speaker adaptation by modeling the speaker variation in a continuous speech recognition system
    Strom, N
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 989 - 992
  • [50] STC-innovation Speaker Recognition Systems for Far-Field Speaker Verification Challenge 2020
    Gusev, Aleksei
    Volokhov, Vladimir
    Vinogradova, Alisa
    Andzhukaev, Tseren
    Shulipa, Andrey
    Novoselov, Sergey
    Pekhovsky, Timur
    Kozlov, Alexander
    INTERSPEECH 2020, 2020, : 3466 - 3470