Real-time speaker identification and verification

被引:115
|
作者
Kinnunen, T [1 ]
Karpov, E [1 ]
Fränti, P [1 ]
机构
[1] Univ Joensuu, Dept Comp Sci, FIN-80101 Joensuu, Finland
关键词
Gaussian mixture model (GMM); pre-quantization; real-time; speaker pruning; speaker recognition; vector quantization (VQ);
D O I
10.1109/TSA.2005.853206
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In speaker identification, most of the computation originates from the distance or likelihood computations between the feature vectors of the unknown speaker and the models in the database. The identification time depends on the number of feature vectors, their dimensionality, the complexity of the speaker models and the number of speakers. In this paper, we concentrate on optimizing vector quantization (VQ) based speaker identification. We reduce the number of test vectors by pre-quantizing the test sequence prior to matching, and the number of speakers by pruning out unlikely speakers during the identification process. The best variants are then generalized to Gaussian mixture model (GMM) based modeling. We apply the algorithms also to efficient cohort set search for score normalization in speaker verification. We obtain a speed-up factor of 16:1 in the case of VQ-based modeling with minor degradation in the identification accuracy, and 34:1 in the case of GMM-based modeling. An equal error rate of 7% can be reached in 0.84 s on average when the length of test utterance is 30.4 s.
引用
收藏
页码:277 / 288
页数:12
相关论文
共 50 条
  • [1] Presentation of real-time system for automatic speaker identification and verification
    David, P
    [J]. 7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IV, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING, 2003, : 372 - 376
  • [2] REAL-TIME TECHNIQUE FOR SPEAKER VERIFICATION BY COMPUTER
    LUMMIS, RC
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 50 (01): : 106 - &
  • [3] Speaker pruning algorithm for real-time speaker identification
    Kinnunen, T
    Karpov, E
    Fränti, P
    [J]. AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 639 - 646
  • [4] Real-time speaker identification system
    Al-Shboul, Bashar
    Alsawalqah, Hamad
    Lee, Dongman
    [J]. PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE: COMPUTER SCIENCE CHALLENGES, 2007, : 422 - +
  • [5] Real-Time Speaker Identification Using Speaker Model Distance
    Zeinali, Hossein
    Sameti, Hossein
    Hadian, Hossein
    [J]. 2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 643 - 647
  • [6] REAL-TIME SPEAKER IDENTIFICATION FOR VIDEO CONFERENCING
    Saravi, S.
    Zafar, I.
    Edirisinghe, E. A.
    Kalawsky, R. S.
    [J]. REAL-TIME IMAGE AND VIDEO PROCESSING 2010, 2010, 7724
  • [7] Real-Time Speaker Verification System Implemented on Reconfigurable Hardware
    Ramos-Lara, Rafael
    Lopez-Garcia, Mariano
    Canto-Navarro, Enrique
    Puente-Rodriguez, Luis
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2013, 71 (02): : 89 - 103
  • [8] Real-Time Speaker Verification System Implemented on Reconfigurable Hardware
    Rafael Ramos-Lara
    Mariano López-García
    Enrique Cantó-Navarro
    Luís Puente-Rodriguez
    [J]. Journal of Signal Processing Systems, 2013, 71 : 89 - 103
  • [9] Unsupervised real-time speaker identification for daily movies
    Ying, L
    Kuo, CCJ
    [J]. INTERNET MULTIMEDIA MANAGEMENT SYSTEMS III, 2002, 4862 : 151 - 162
  • [10] Real-time Speaker Verification Based on GMM-UBM for PDA
    Chen, Yan
    Hong, Qingyang
    Chen, XiaoYang
    Zhang, Caihong
    [J]. SEC 2008: PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL SYMPOSIUM ON EMBEDDED COMPUTING, 2008, : 243 - 246