Discussion on score normalization and language robustness in text-independent multi-language speaker verification

被引：0

作者：

Zhao, Jian ^{[1
]}

Dong, Yuan ^{[1
,2
]}

Zhao, Xianyu ^{[2
]}

Yang, Hao ^{[1
]}

Lu, Liang ^{[1
]}

Wang, Haila ^{[2
]}

机构：

[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China

[2] France Telecom Res & Dev Ctr, Beijing 100080, Peoples R China

来源：

ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES | 2007年 / 4681卷

关键词：

score normalization; speaker adaptive test normalization; language; robustness; cross similarity measurement; speaker verification; NIST; 06; speaker; recognition evaluation;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In speaker recognition fields, score normalization is a widely used and effective technique to enhance the recognition performances and is developing further. In this paper, we are focused on the comparison among many kinds of candidates of score normalization methods and a new implementation of the speaker adaptive test normalization (ATnorm) based on a cross similarity measurement is presented which doesn't need an extra corpus for speaker adaptive impostor cohort selection. The use of ATnorm for the language robustness of the multi-language speaker verification is also investigated. Experiments are conducted on the core task of the 2006 NIST Speaker Recognition Evaluation (SRE) corpus. The experimental results indicate that all the score normalization methods mentioned can improve the recognition performances and ATnorm behaves best. Moreover, ATnorrn can further contribute to the performance as a means of language robustness.

引用

页码：1121 / +

页数：3

共 50 条

[41] Text-independent speaker verification with dynamic trajectory model
Xiang, B
IEEE SIGNAL PROCESSING LETTERS, 2003, 10 (05) : 141 - 143
[42] FACTORED COVARIANCE MODELING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Wang, Eryu
Lee, Kong Aik
Ma, Bin
Li, Haizhou
Guo, Wu
Dai, Lirong
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4856 - 4859
[43] Mixup Learning Strategies for Text-independent Speaker Verification
Zhu, Yingke
Ko, Tom
Mak, Brian
INTERSPEECH 2019, 2019, : 4345 - 4349
[44] A CORRECTIVE LEARNING APPROACH FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Wen, Yandong
Zhou, Tianyan
Singh, Rita
Raj, Bhiksha
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4894 - 4898
[45] Group-based speaker embeddings for text-independent speaker verification
Jung, Youngmoon
Eom, Youngsik
Lee, Yeonghyeon
Kim, Hoirin
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 496 - 502
[46] Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification
Zhu, Yingke
Ko, Tom
Snyder, David
Mak, Brian
Povey, Daniel
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3573 - 3577
[47] Speaker adaptive cohort selection for Tnorm in text-independent speaker verification
Sturim, DE
Reynolds, DA
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 741 - 744
[48] Text-independent speaker identification utilizing likelihood normalization technique
Toyohashi Univ of Technology, Toyohashi-shi, Japan
IEICE Trans Inf Syst, 5 (585-593):
[49] Text-independent speaker identification utilizing likelihood normalization technique
Markov, KP
Nakagawa, S
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1997, E80D (05) : 585 - 593
[50] Significance of Constraining Text in Limited Data Text-independent Speaker Verification
Das, Rohan Kumar
Jelil, Sarfaraz
Prasanna, S. R. Mahadeva
2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,

← 1 2 3 4 5 →