A tutorial on text-independent speaker verification

被引：445

作者：

Bimbot, F ^{[1
]}

Bonastre, JF

Fredouille, C

Gravier, G

Magrin-Chagnolleau, I

Meignier, S

Merlin, T

Ortega-García, J

Petrovska-Delacrétaz, D

Reynolds, DA

机构：

[1] IRISA, INRIA, F-35042 Rennes, France

[2] CNRS, F-35042 Rennes, France

[3] Univ Avignon, LIA, F-84911 Avignon 9, France

[4] CNRS, Lab Dynam Langage, F-69369 Lyon 07, France

[5] Univ Politecn Madrid, ATVS, E-28040 Madrid, Spain

[6] Univ Fribourg, Dept Informat, DIVA Lab, CH-1700 Fribourg, Switzerland

[7] MIT, Lincoln Lab, Cambridge, MA 02420 USA

来源：

EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING | 2004年 / 2004卷 / 04期

关键词：

speaker verification; text-independent; cepstral analysis; Gaussian mixture modeling;

D O I：

10.1155/S1110865704310024

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents an overview of a state-of-the-art text-independent speaker verification system. First, an introduction proposes a modular scheme of the training and test phases of a speaker verification system. Then, the most commonly speech parameterization used in speaker verification, namely, cepstral analysis, is detailed. Gaussian mixture modeling, which is the speaker modeling technique used in most systems, is then explained. A few speaker modeling alternatives, namely, neural networks and support vector machines, are mentioned. Normalization of scores is then explained, as this is a very important step to deal with real-world data. The evaluation of a speaker verification system is then detailed, and the detection error trade-off (DET) curve is explained. Several extensions of speaker verification are then enumerated, including speaker tracking and segmentation by speakers. Then, some applications of speaker verification are proposed, including on-site applications, remote applications, applications relative to structuring audio information, and games. Issues concerning the forensic area are then recalled, as we believe it is very important to inform people about the actual performance and limitations of speaker verification systems. This paper concludes by giving a few research trends in speaker verification for the next couple of years.

引用

页码：430 / 451

页数：22

共 50 条

[31] Mixup Learning Strategies for Text-independent Speaker Verification
Zhu, Yingke
Ko, Tom
Mak, Brian
INTERSPEECH 2019, 2019, : 4345 - 4349
[32] A CORRECTIVE LEARNING APPROACH FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Wen, Yandong
Zhou, Tianyan
Singh, Rita
Raj, Bhiksha
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4894 - 4898
[33] Group-based speaker embeddings for text-independent speaker verification
Jung, Youngmoon
Eom, Youngsik
Lee, Yeonghyeon
Kim, Hoirin
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 496 - 502
[34] Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification
Zhu, Yingke
Ko, Tom
Snyder, David
Mak, Brian
Povey, Daniel
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3573 - 3577
[35] Speaker adaptive cohort selection for Tnorm in text-independent speaker verification
Sturim, DE
Reynolds, DA
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 741 - 744
[36] Significance of Constraining Text in Limited Data Text-independent Speaker Verification
Das, Rohan Kumar
Jelil, Sarfaraz
Prasanna, S. R. Mahadeva
2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
[37] GENERATIVE X-VECTORS FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Xu, Longting
Das, Rohan Kumar
Yilmaz, Emre
Yang, Jichen
Li, Haizhou
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 1014 - 1020
[38] Discriminative transformation for sufficient adaptation in text-independent speaker verification
Yang, Hao
Dong, Yuan
Zhao, Xianyu
Zha, Jian
Wang, Haila
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 558 - +
[39] TEXT-INDEPENDENT SPEAKER VERIFICATION WITH ADVERSARIAL LEARNING ON SHORT UTTERANCES
Liu, Kai
Zhou, Huan
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6569 - 6573
[40] GRAPH ATTENTIVE FEATURE AGGREGATION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Shim, Hye-Jin
Heo, Jungwoo
Park, Jae-Han
Lee, Ga-Hui
Yu, Ha-Jin
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7972 - 7976

← 1 2 3 4 5 →