The Impact of Data Dependence on Speaker Recognition Evaluation

被引:6
|
作者
Wu, Jin Chu [1 ]
Martin, Alvin F. [1 ]
Greenberg, Craig S. [1 ]
Kacker, Raghu N. [1 ]
机构
[1] NIST, Gaithersburg, MD 20899 USA
关键词
Bootstrap; data dependence; multinomial probability; resampling; speaker recognition; standard error (SE); ROC ANALYSIS; FINGERPRINT DATA; SAMPLE-SIZE; VERIFICATION; BOOTSTRAP; ERROR; NIST; SYSTEMS; CURVE;
D O I
10.1109/TASLP.2016.2614725
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The data dependence due to multiple use of the same subjects has impact on the standard error (SE) of the detection cost function (DCF) in speaker recognition evaluation. The DCF is defined as a weighted sum of the probabilities of type I and type II errors at a given threshold. A two-layer data structure is constructed: Target scores are grouped into target sets based on the dependence, and likewise for non-target scores. On account of the needed equal probabilities for scores being selected when resampling, target sets must contain the same number of target scores, and so must non-target sets. In addition to the bootstrap method with i.i.d. assumption, the nonparametric two-sample one-layer and two-layer bootstrap methods are carried out based on whether the resampling takes place only on sets, or subsequently on scores within the sets. Due to the stochastic nature of the bootstrap, the distributions of the SEs of the DCF estimated using the three different bootstrap methods are created and compared. After performing hypothesis testing, it is found that data dependence increases not only the SE but also the variation of the SE, and the two-layer bootstrap is more conservative than the one-layer bootstrap. The rationale regarding the different impacts of the three bootstrap methods on the estimated SEs is investigated.
引用
收藏
页码:5 / 18
页数:14
相关论文
共 50 条
  • [1] Statistical Analysis for Speaker Recognition Evaluation With Data Dependence and Three Score Distributions
    Wu, Jin Chu
    Kacker, Raghu N.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1 - 14
  • [2] Data Dependency on Measurement Uncertainties in Speaker Recognition Evaluation
    Wu, Jin Chu
    Martin, Alvin F.
    Greenberg, Craig S.
    Kacker, Raghu N.
    [J]. ACTIVE AND PASSIVE SIGNATURES III, 2012, 8382
  • [3] Significance Test with Data Dependency in Speaker Recognition Evaluation
    Wu, Jin Chu
    Martin, Alvin F.
    Greenberg, Craig S.
    Kacker, Raghu N.
    Stanford, Vincent M.
    [J]. ACTIVE AND PASSIVE SIGNATURES IV, 2013, 8734
  • [4] Evaluation on Data - Speaker Dependability Approaches for Speech Recognition Tasks
    Saod, Aini Hafizah Mohd
    Sulaiman, Siti Noraini
    Harron, Nur Athiqah
    Ahmad, Azizah
    Ramlan, Siti Azura
    Ramli, Dzati Athiar
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE 2012), 2012, : 254 - 258
  • [5] Automatic Speaker Recognition with Limited Data
    Li, Ruirui
    Jiang, Jyun-Yu
    Liu, Jiahao
    Hsieh, Chu-Cheng
    Wang, Wei
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 340 - 348
  • [6] The RedDots Data Collection for Speaker Recognition
    Lee, Kong Aik
    Larcher, Anthony
    Wang, Guangsen
    Kenny, Patrick
    Brummer, Niko
    van Leeuwen, David
    Aronowitz, Hagai
    Kockmann, Marcel
    Vaqueros, Carlos
    Ma, Bin
    Li, Haizhou
    Stafylakis, Themos
    Alam, Jahangir
    Swart, Albert
    Perez, Javier
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2996 - 3000
  • [7] Uncertainties of Measures in Speaker Recognition Evaluation
    Wu, Jin Chu
    Martin, Alvin F.
    Greenberg, Craig S.
    Kacker, Raghu N.
    [J]. ACTIVE AND PASSIVE SIGNATURES II, 2011, 8040
  • [8] The 2018 NIST Speaker Recognition Evaluation
    Sadjadi, Seyed Omid
    Greenberg, Craig
    Singer, Elliot
    Reynolds, Douglas
    Mason, Lisa
    Hernandez-Cordero, Jaime
    [J]. INTERSPEECH 2019, 2019, : 1483 - 1487
  • [9] The NIST 2010 Speaker Recognition Evaluation
    Martin, Alvin F.
    Greenberg, Craig S.
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2734 - 2737
  • [10] CCC speaker recognition evaluation 2006: Overview, methods, data, results and perspective
    Zheng, Thomas Fang
    Song, Zhanjiang
    Zhang, Lihong
    Brasser, Michael
    Wu, Wei
    Deng, Jing
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 485 - 493