Nuance - Politecnico di Torino's 2016 NIST Speaker Recognition Evaluation System

被引:6
|
作者
Colibro, Daniele [1 ]
Vair, Claudio [1 ]
Dalmasso, Emanuele [1 ]
Farrell, Kevin [1 ]
Karvitsky, Gennady [1 ]
Cumani, Sandro [2 ]
Laface, Pietro [2 ]
机构
[1] Nuance Commun Inc, Burlington, MA 01803 USA
[2] Politecn Torino, Turin, Italy
关键词
Speaker Recognition; i-vector; PLDA; PSVM; AS-Norm; Top-Norm;
D O I
10.21437/Interspeech.2017-797
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the Nuance-Politecnico di Torino (NPT) speaker recognition system submitted to the NIST SRE16 evaluation campaign. Included are the results of post evaluation tests, focusing on the analysis of the performance of generative and discriminative classifiers, and of score normalization. The submitted system combines the results of four GMM-IVector models. two DNN-IVector models and a GMM-SVM acoustic system. Each system exploits acoustic front-end parameters that differ by feature type and dimension. We analyze the main components of our submission, which contributed to obtaining 8.1% EER and 0.532 actual. C-primary in the challenging SRE16 Fixed condition.
引用
收藏
页码:1338 / 1342
页数:5
相关论文
共 50 条
  • [21] The NIST 1999 Speaker Recognition Evaluation - An overview
    Martin, A
    Przybocki, M
    DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 1 - 18
  • [22] A Noise-Robust System for NIST 2012 Speaker Recognition Evaluation
    Ferrer, Luciana
    McLaren, Mitchell
    Scheffer, Nicolas
    Lei, Yun
    Graciarena, Martin
    Mitra, Vikramjit
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1980 - 1984
  • [23] Development of the Primary CRIM System for the NIST 2008 Speaker Recognition Evaluation
    Kenny, Patrick
    Dehak, Najim
    Ouellet, Pierre
    Gupta, Vishwa
    Dumouchel, Pierre
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1401 - 1404
  • [24] THE 14U SYSTEM IN NIST 2008 SPEAKER RECOGNITION EVALUATION
    Li, Haizhou
    Ma, Bin
    Lee, Kong-Aik
    Sun, Hanwu
    Zhu, Donglai
    Sim, Khe Chai
    You, Changhuai
    Tong, Rong
    Kaerkkaeinen, Ismo
    Huang, Chien-Lin
    Pervouchine, Vladimir
    Guo, Wu
    Li, Yijie
    Dai, Lirong
    Nosratighods, Mohaddeseh
    Tharmarajah, Thiruvaran
    Epps, Julien
    Ambikairajah, Eliathamby
    Chng, Eng-Siong
    Schultz, Tanja
    Jin, Qin
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4201 - +
  • [25] The DKU-SMIIP System for NIST 2018 Speaker Recognition Evaluation
    Cai, Danwei
    Gai, Weicheng
    Li, Ming
    INTERSPEECH 2019, 2019, : 4370 - 4374
  • [26] The contribution of cepstral and stylistic features to SRI's 2005 NIST speaker recognition evaluation system
    Ferrer, Luciana
    Shriberg, Elizabeth
    Kajarekar, Sachin S.
    Stolcke, Andreas
    Sonmez, Kemal
    Venkataraman, Anand
    Bratt, Harry
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 101 - 104
  • [27] The 14U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016
    Lee, K. A.
    Hautamaki, V.
    Kinnunen, T.
    Larcher, A.
    Zhang, C.
    Nautsch, A.
    Stafylakis, T.
    Liu, G.
    Rouvier, M.
    Rao, W.
    Alegre, F.
    Ma, J.
    Mak, M. W.
    Sarkar, A. K.
    Delgado, H.
    Saeidi, R.
    Aronowitz, H.
    Sizov, A.
    Sun, H.
    Nguyen, T. H.
    Wang, G.
    Ma, B.
    Vestman, V.
    Sahidullah, M.
    Halonen, M.
    Kanervisto, A.
    Le Lan, G.
    Bahmaninezhad, F.
    Isadskiy, S.
    Rathgeb, C.
    Busch, C.
    Tzimiropoulos, G.
    Qian, Q.
    Wang, Z.
    Zhao, Q.
    Wang, T.
    Li, H.
    Xue, J.
    Zhu, S.
    Jin, R.
    Zhao, T.
    Bousquet, P. -M
    Ajili, M.
    Kheder, W. B.
    Matrouf, D.
    Lim, Z. H.
    Xu, C.
    Xu, H.
    Xiao, X.
    Chng, E. S.
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1328 - 1332
  • [28] Speaker diarization system on the 2007 NIST rich transcription meeting recognition evaluation
    Sun, Hanwu
    Nwe, Tin Lay
    Chin, Eugene
    Koh, Wei
    Bin, Ma
    Li, Haizhou
    MULTIMEDIA SYSTEMS AND APPLICATIONS X, 2007, 6777
  • [29] THU-EE System Fusion for the NIST 2012 Speaker Recognition Evaluation
    Zhang, Wei-Qiang
    Li, Zhi-Yi
    Liu, Weiwei
    Liu, Jia
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2473 - 2477
  • [30] CRSS SYSTEMS FOR 2012 NIST SPEAKER RECOGNITION EVALUATION
    Hasan, Taufiq
    Sadjadi, Seyed Omid
    Liu, Gang
    Shokouhi, Navid
    Boril, Hynek
    Hansen, John H. L.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6783 - 6787