The Impact of Data Dependence on Speaker Recognition Evaluation

被引：6

作者：

Wu, Jin Chu ^{[1
]}

Martin, Alvin F. ^{[1
]}

Greenberg, Craig S. ^{[1
]}

Kacker, Raghu N. ^{[1
]}

机构：

[1] NIST, Gaithersburg, MD 20899 USA

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2017年 / 25卷 / 01期

关键词：

Bootstrap; data dependence; multinomial probability; resampling; speaker recognition; standard error (SE); ROC ANALYSIS; FINGERPRINT DATA; SAMPLE-SIZE; VERIFICATION; BOOTSTRAP; ERROR; NIST; SYSTEMS; CURVE;

D O I：

10.1109/TASLP.2016.2614725

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The data dependence due to multiple use of the same subjects has impact on the standard error (SE) of the detection cost function (DCF) in speaker recognition evaluation. The DCF is defined as a weighted sum of the probabilities of type I and type II errors at a given threshold. A two-layer data structure is constructed: Target scores are grouped into target sets based on the dependence, and likewise for non-target scores. On account of the needed equal probabilities for scores being selected when resampling, target sets must contain the same number of target scores, and so must non-target sets. In addition to the bootstrap method with i.i.d. assumption, the nonparametric two-sample one-layer and two-layer bootstrap methods are carried out based on whether the resampling takes place only on sets, or subsequently on scores within the sets. Due to the stochastic nature of the bootstrap, the distributions of the SEs of the DCF estimated using the three different bootstrap methods are created and compared. After performing hypothesis testing, it is found that data dependence increases not only the SE but also the variation of the SE, and the two-layer bootstrap is more conservative than the one-layer bootstrap. The rationale regarding the different impacts of the three bootstrap methods on the estimated SEs is investigated.

引用

页码：5 / 18

页数：14

共 50 条

[1] Statistical Analysis for Speaker Recognition Evaluation With Data Dependence and Three Score Distributions
Wu, Jin Chu
Kacker, Raghu N.
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1 - 14
[2] Data Dependency on Measurement Uncertainties in Speaker Recognition Evaluation
Wu, Jin Chu
Martin, Alvin F.
Greenberg, Craig S.
Kacker, Raghu N.
[J]. ACTIVE AND PASSIVE SIGNATURES III, 2012, 8382
[3] Significance Test with Data Dependency in Speaker Recognition Evaluation
Wu, Jin Chu
Martin, Alvin F.
Greenberg, Craig S.
Kacker, Raghu N.
Stanford, Vincent M.
[J]. ACTIVE AND PASSIVE SIGNATURES IV, 2013, 8734
[4] Evaluation on Data - Speaker Dependability Approaches for Speech Recognition Tasks
Saod, Aini Hafizah Mohd
Sulaiman, Siti Noraini
Harron, Nur Athiqah
Ahmad, Azizah
Ramlan, Siti Azura
Ramli, Dzati Athiar
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE 2012), 2012, : 254 - 258
[5] Automatic Speaker Recognition with Limited Data
Li, Ruirui
Jiang, Jyun-Yu
Liu, Jiahao
Hsieh, Chu-Cheng
Wang, Wei
[J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 340 - 348
[6] The RedDots Data Collection for Speaker Recognition
Lee, Kong Aik
Larcher, Anthony
Wang, Guangsen
Kenny, Patrick
Brummer, Niko
van Leeuwen, David
Aronowitz, Hagai
Kockmann, Marcel
Vaqueros, Carlos
Ma, Bin
Li, Haizhou
Stafylakis, Themos
Alam, Jahangir
Swart, Albert
Perez, Javier
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2996 - 3000
[7] Uncertainties of Measures in Speaker Recognition Evaluation
Wu, Jin Chu
Martin, Alvin F.
Greenberg, Craig S.
Kacker, Raghu N.
[J]. ACTIVE AND PASSIVE SIGNATURES II, 2011, 8040
[8] The 2018 NIST Speaker Recognition Evaluation
Sadjadi, Seyed Omid
Greenberg, Craig
Singer, Elliot
Reynolds, Douglas
Mason, Lisa
Hernandez-Cordero, Jaime
[J]. INTERSPEECH 2019, 2019, : 1483 - 1487
[9] The NIST 2010 Speaker Recognition Evaluation
Martin, Alvin F.
Greenberg, Craig S.
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2734 - 2737
[10] CCC speaker recognition evaluation 2006: Overview, methods, data, results and perspective
Zheng, Thomas Fang
Song, Zhanjiang
Zhang, Lihong
Brasser, Michael
Wu, Wei
Deng, Jing
[J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 485 - 493

← 1 2 3 4 5 →