A tutorial on text-independent speaker verification

被引：445

作者：

Bimbot, F ^{[1
]}

Bonastre, JF

Fredouille, C

Gravier, G

Magrin-Chagnolleau, I

Meignier, S

Merlin, T

Ortega-García, J

Petrovska-Delacrétaz, D

Reynolds, DA

机构：

[1] IRISA, INRIA, F-35042 Rennes, France

[2] CNRS, F-35042 Rennes, France

[3] Univ Avignon, LIA, F-84911 Avignon 9, France

[4] CNRS, Lab Dynam Langage, F-69369 Lyon 07, France

[5] Univ Politecn Madrid, ATVS, E-28040 Madrid, Spain

[6] Univ Fribourg, Dept Informat, DIVA Lab, CH-1700 Fribourg, Switzerland

[7] MIT, Lincoln Lab, Cambridge, MA 02420 USA

来源：

EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING | 2004年 / 2004卷 / 04期

关键词：

speaker verification; text-independent; cepstral analysis; Gaussian mixture modeling;

D O I：

10.1155/S1110865704310024

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents an overview of a state-of-the-art text-independent speaker verification system. First, an introduction proposes a modular scheme of the training and test phases of a speaker verification system. Then, the most commonly speech parameterization used in speaker verification, namely, cepstral analysis, is detailed. Gaussian mixture modeling, which is the speaker modeling technique used in most systems, is then explained. A few speaker modeling alternatives, namely, neural networks and support vector machines, are mentioned. Normalization of scores is then explained, as this is a very important step to deal with real-world data. The evaluation of a speaker verification system is then detailed, and the detection error trade-off (DET) curve is explained. Several extensions of speaker verification are then enumerated, including speaker tracking and segmentation by speakers. Then, some applications of speaker verification are proposed, including on-site applications, remote applications, applications relative to structuring audio information, and games. Issues concerning the forensic area are then recalled, as we believe it is very important to inform people about the actual performance and limitations of speaker verification systems. This paper concludes by giving a few research trends in speaker verification for the next couple of years.

引用

页码：430 / 451

页数：22

共 50 条

[1] A tutorial on text-independent speaker verification
Bimbot, F. (bimbot@irisa.fr), 1600, Hindawi Publishing Corporation (2004):
[2] A Tutorial on Text-Independent Speaker Verification
Frédéric Bimbot
Jean-François Bonastre
Corinne Fredouille
Guillaume Gravier
Ivan Magrin-Chagnolleau
Sylvain Meignier
Teva Merlin
Javier Ortega-García
Dijana Petrovska-Delacrétaz
Douglas A. Reynolds
EURASIP Journal on Advances in Signal Processing, 2004
[3] Graphical models for text-independent speaker verification
Sánchez-Soto, E
Sigelle, M
Chollet, G
NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 410 - 415
[4] Language dependency in text-independent speaker verification
Auckenthaler, R
Carey, MJ
Mason, JSD
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 441 - 444
[5] Text-independent speaker verification in embedded environments
Tydlitat, Borivoj
Navratil, Jiri
Pelecanos, Jason W.
Ramaswamy, Ganesh N.
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 293 - +
[6] ORTHOGONAL TRAINING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Zhu, Yingke
Mak, Brian
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6584 - 6588
[7] Adaptive method for text-independent speaker verification
Zhang, Yiying, 2000, (11):
[8] Deeply Fused Speaker Embeddings for Text-Independent Speaker Verification
Bhattacharya, Gautam
Alam, Jahangir
Gupta, Vishwa
Kenny, Patrick
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3588 - 3592
[9] Deep Speaker Feature Learning for Text-independent Speaker Verification
Li, Lantian
Chen, Yixiang
Shi, Zing
Tang, Zhiyuan
Wang, Dong
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1542 - 1546
[10] A Survey on Text-Dependent and Text-Independent Speaker Verification
Tu, Youzhi
Lin, Weiwei
Mak, Man-Wai
IEEE ACCESS, 2022, 10 : 99038 - 99049

← 1 2 3 4 5 →