The Relevance of NIST Speaker Recognition Evaluations

被引:0
|
作者
Asha, T. [1 ]
Murthy, Hema A. [1 ]
机构
[1] Indian Inst Technol, Madras 600036, Tamil Nadu, India
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Feature extraction and building of the Universal Background Model (UBM) are crucial for building speaker verification/identification systems in the total variability subspace (TVS) framework. The motivation of this study is to analyze the significance of various parameters involved in front end processing for different databases. A number of different parameters like energy threshold for voice activity detection, the number of filters, the warping of the frequency scale, the number of cepstral coefficients and the shape of the filter are studied. Three different databases namely, NIST 2003, NisT 2010 and NTIMIT are studied. The optimal front-end obtained using NIST 2003 is observed to function well for NIST 2010 as conditions involving similar data was evaluated for both the databases. On the other hand, it is shown that the same optimal front-end is not scalable for NTIMIT database which is collected from a different environment. The experiments performed in this paper indicate that the optimal front-end parameters are specific to a particular dataset. In addition, mismatch between development data and evaluation data is shown to result in a poor system. Given the results, the paper questions the relevance of the NIST Speaker Recognition evaluations in real environments.
引用
下载
收藏
页数:6
相关论文
共 50 条
  • [31] The NIST speaker recognition evaluation - Overview, methodology, systems, results, perspective
    Doddington, GR
    Przybocki, MA
    Martin, AF
    Reynolds, DA
    SPEECH COMMUNICATION, 2000, 31 (2-3) : 225 - 254
  • [32] Speaker recognition - The ATVS-UAM system at NIST SRE 05
    Gonzalez-Rodriguez, Joaquin
    Ramos-Castro, Daniel
    Toledano, Doroteo Torre
    Montero-Asenjo, Alberto
    Gonzalez-Dominguez, Javier
    Lopez-Moreno, Ignacio
    Fierrez-Aguilar, Julian
    Garcia-Romero, Daniel
    Ortega-Garcia, Javier
    IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2007, 22 (01) : 15 - 21
  • [33] A Noise-Robust System for NIST 2012 Speaker Recognition Evaluation
    Ferrer, Luciana
    McLaren, Mitchell
    Scheffer, Nicolas
    Lei, Yun
    Graciarena, Martin
    Mitra, Vikramjit
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1980 - 1984
  • [34] 13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE
    Matejka, Pavel
    Plchot, Oldrich
    Glembek, Ondrej
    Burget, Lukas
    Rohdin, Johan
    Zeinali, Hossein
    Mosner, Ladislav
    Silnova, Anna
    Novotny, Ondrej
    Diez, Mireia
    Cernocky, Jan Honza
    COMPUTER SPEECH AND LANGUAGE, 2020, 63
  • [35] Development of the Primary CRIM System for the NIST 2008 Speaker Recognition Evaluation
    Kenny, Patrick
    Dehak, Najim
    Ouellet, Pierre
    Gupta, Vishwa
    Dumouchel, Pierre
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1401 - 1404
  • [36] THE 14U SYSTEM IN NIST 2008 SPEAKER RECOGNITION EVALUATION
    Li, Haizhou
    Ma, Bin
    Lee, Kong-Aik
    Sun, Hanwu
    Zhu, Donglai
    Sim, Khe Chai
    You, Changhuai
    Tong, Rong
    Kaerkkaeinen, Ismo
    Huang, Chien-Lin
    Pervouchine, Vladimir
    Guo, Wu
    Li, Yijie
    Dai, Lirong
    Nosratighods, Mohaddeseh
    Tharmarajah, Thiruvaran
    Epps, Julien
    Ambikairajah, Eliathamby
    Chng, Eng-Siong
    Schultz, Tanja
    Jin, Qin
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4201 - +
  • [37] The DKU-SMIIP System for NIST 2018 Speaker Recognition Evaluation
    Cai, Danwei
    Gai, Weicheng
    Li, Ming
    INTERSPEECH 2019, 2019, : 4370 - 4374
  • [38] UTD-CRSS SYSTEMS FOR 2018 NIST SPEAKER RECOGNITION EVALUATION
    Zhang, Chunlei
    Bahmaninezhad, Fahimeh
    Ranjan, Shivesh
    Dubey, Harishchandra
    Xia, Wei
    Hansen, John H. L.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5776 - 5780
  • [40] Uncertainty propagation for noise robust speaker recognition: the case of NIST-SRE
    Ribas, Dayana
    Vincent, Emmanuel
    Calvo, Jose Ramon
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3536 - 3540