Real-time speaker identification and verification

被引:115
|
作者
Kinnunen, T [1 ]
Karpov, E [1 ]
Fränti, P [1 ]
机构
[1] Univ Joensuu, Dept Comp Sci, FIN-80101 Joensuu, Finland
关键词
Gaussian mixture model (GMM); pre-quantization; real-time; speaker pruning; speaker recognition; vector quantization (VQ);
D O I
10.1109/TSA.2005.853206
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In speaker identification, most of the computation originates from the distance or likelihood computations between the feature vectors of the unknown speaker and the models in the database. The identification time depends on the number of feature vectors, their dimensionality, the complexity of the speaker models and the number of speakers. In this paper, we concentrate on optimizing vector quantization (VQ) based speaker identification. We reduce the number of test vectors by pre-quantizing the test sequence prior to matching, and the number of speakers by pruning out unlikely speakers during the identification process. The best variants are then generalized to Gaussian mixture model (GMM) based modeling. We apply the algorithms also to efficient cohort set search for score normalization in speaker verification. We obtain a speed-up factor of 16:1 in the case of VQ-based modeling with minor degradation in the identification accuracy, and 34:1 in the case of GMM-based modeling. An equal error rate of 7% can be reached in 0.84 s on average when the length of test utterance is 30.4 s.
引用
收藏
页码:277 / 288
页数:12
相关论文
共 50 条
  • [21] Real-time unsupervised speaker change detection
    Lu, L
    Zhang, HJ
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 358 - 361
  • [22] Application for Real-time Personalized Speaker Extraction
    Ronssin, Damien
    Cernak, Milos
    [J]. INTERSPEECH 2022, 2022, : 1955 - 1956
  • [23] TOWARDS REAL-TIME AUDIOVISUAL SPEAKER LOCALIZATION
    Monaci, Gianluca
    [J]. 19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1055 - 1059
  • [24] Mobile intelligent terminal speaker identification for real-time monitoring system of sports training
    Yue, Yibo
    Yang, Yucheng
    [J]. EVOLUTIONARY INTELLIGENCE, 2023, 16 (06) : 1801 - 1812
  • [25] Mobile intelligent terminal speaker identification for real-time monitoring system of sports training
    Yibo Yue
    Yucheng Yang
    [J]. Evolutionary Intelligence, 2023, 16 : 1801 - 1812
  • [26] A real time speaker verification demonstration on the smart flow system
    Xu, R
    Mei, G
    Ren, Z
    Kwan, C
    Stanford, V
    Aube, J
    Rochet, C
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 226 - 229
  • [27] REAL-TIME FINGERPRINT VERIFICATION SYSTEM
    GAMBLE, FT
    FRYE, LM
    GRIESER, DR
    [J]. APPLIED OPTICS, 1992, 31 (05): : 652 - 655
  • [28] Face verification for real-time applications
    Romano, R
    Beymer, D
    Poggio, T
    [J]. IMAGE UNDERSTANDING WORKSHOP, 1996 PROCEEDINGS, VOLS I AND II, 1996, : 747 - 756
  • [29] An abstraction technique for real-time verification
    Clarke, Edmund M.
    Lerda, Flavio
    Talupur, Muralidhar
    [J]. NEXT GENERATION DESIGN AND VERIFICATION METHODOLOGIES FOR DISTRIBUTED EMBEDDED CONTROL SYSTEMS, 2007, : 1 - +
  • [30] TEMPORAL VERIFICATION OF REAL-TIME SYSTEMS
    CAMPOS, SV
    CLARKE, EM
    MARRERO, W
    MINEA, M
    HIRAISHI, H
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (07) : 796 - 801