Real-time speaker identification and verification

被引：115

作者：

Kinnunen, T ^{[1
]}

Karpov, E ^{[1
]}

Fränti, P ^{[1
]}

机构：

[1] Univ Joensuu, Dept Comp Sci, FIN-80101 Joensuu, Finland

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2006年 / 14卷 / 01期

关键词：

Gaussian mixture model (GMM); pre-quantization; real-time; speaker pruning; speaker recognition; vector quantization (VQ);

D O I：

10.1109/TSA.2005.853206

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In speaker identification, most of the computation originates from the distance or likelihood computations between the feature vectors of the unknown speaker and the models in the database. The identification time depends on the number of feature vectors, their dimensionality, the complexity of the speaker models and the number of speakers. In this paper, we concentrate on optimizing vector quantization (VQ) based speaker identification. We reduce the number of test vectors by pre-quantizing the test sequence prior to matching, and the number of speakers by pruning out unlikely speakers during the identification process. The best variants are then generalized to Gaussian mixture model (GMM) based modeling. We apply the algorithms also to efficient cohort set search for score normalization in speaker verification. We obtain a speed-up factor of 16:1 in the case of VQ-based modeling with minor degradation in the identification accuracy, and 34:1 in the case of GMM-based modeling. An equal error rate of 7% can be reached in 0.84 s on average when the length of test utterance is 30.4 s.

引用

页码：277 / 288

页数：12

共 50 条

[1] Presentation of real-time system for automatic speaker identification and verification
David, P
[J]. 7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IV, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING, 2003, : 372 - 376
[2] REAL-TIME TECHNIQUE FOR SPEAKER VERIFICATION BY COMPUTER
LUMMIS, RC
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 50 (01): : 106 - &
[3] Speaker pruning algorithm for real-time speaker identification
Kinnunen, T
Karpov, E
Fränti, P
[J]. AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 639 - 646
[4] Real-time speaker identification system
Al-Shboul, Bashar
Alsawalqah, Hamad
Lee, Dongman
[J]. PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE: COMPUTER SCIENCE CHALLENGES, 2007, : 422 - +
[5] Real-Time Speaker Identification Using Speaker Model Distance
Zeinali, Hossein
Sameti, Hossein
Hadian, Hossein
[J]. 2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 643 - 647
[6] REAL-TIME SPEAKER IDENTIFICATION FOR VIDEO CONFERENCING
Saravi, S.
Zafar, I.
Edirisinghe, E. A.
Kalawsky, R. S.
[J]. REAL-TIME IMAGE AND VIDEO PROCESSING 2010, 2010, 7724
[7] Real-Time Speaker Verification System Implemented on Reconfigurable Hardware
Ramos-Lara, Rafael
Lopez-Garcia, Mariano
Canto-Navarro, Enrique
Puente-Rodriguez, Luis
[J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2013, 71 (02): : 89 - 103
[8] Real-Time Speaker Verification System Implemented on Reconfigurable Hardware
Rafael Ramos-Lara
Mariano López-García
Enrique Cantó-Navarro
Luís Puente-Rodriguez
[J]. Journal of Signal Processing Systems, 2013, 71 : 89 - 103
[9] Unsupervised real-time speaker identification for daily movies
Ying, L
Kuo, CCJ
[J]. INTERNET MULTIMEDIA MANAGEMENT SYSTEMS III, 2002, 4862 : 151 - 162
[10] Real-time Speaker Verification Based on GMM-UBM for PDA
Chen, Yan
Hong, Qingyang
Chen, XiaoYang
Zhang, Caihong
[J]. SEC 2008: PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL SYMPOSIUM ON EMBEDDED COMPUTING, 2008, : 243 - 246

← 1 2 3 4 5 →