Adaptive Individual Background Model for Speaker Verification

被引:0
|
作者
Bar-Yosef, Yossi [1 ]
Bistritz, Yuval [1 ]
机构
[1] Tel Aviv Univ, Dept Elect Engn, IL-69978 Tel Aviv, Israel
关键词
Model adaptation; Gaussian Mixture Models; Kullback-Leibler divergence; speaker verification; cohort selection; score normalization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most techniques for speaker verification today use Gaussian Mixture Models (GMMs) and make the decision by comparing the likelihood of the speaker model to the likelihood of a universal background model (UBM). The paper proposes to replace the UBM by an individual background model (IBM) that is generated for each speaker. The IBM is created using the K-nearest cohort models and the UBM by a simple new adaptation algorithm. The new GMM-IBM speaker verification system can also be combined with various score normalization techniques that have been proposed to increase the robustness of the GMM-UBM system. Comparative experiments were held on the NIST-2004-SRE database with a plain system setting (without score normalization) and also with the combination of adaptive test normalization (ATnorm). Results indicated that the proposed GMM-IBM system outperforms a comparable GMM-UBM system.
引用
收藏
页码:1279 / 1282
页数:4
相关论文
共 50 条
  • [41] Comparison of model estimation techniques for speaker verification
    Carey, MJ
    Parris, ES
    Bennett, SJ
    LloydThomas, H
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1083 - 1086
  • [42] Speech Enhancement Regularized by a Speaker Verification Model
    Lay, Bunlong
    Gerkmann, Timo
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [43] Speaker Verification Using Gaussian Mixture Model
    Jagtap, Shilpa S.
    Bhalke, D. G.
    2015 INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING (ICPC), 2015,
  • [44] Optimal Impostor Model in Automatic Speaker Verification
    Djellali, Hayet
    Laskri, Mohamed Tayeb
    PROCEEDINGS OF 2012 INTERNATIONAL CONFERENCE ON COMPLEX SYSTEMS (ICCS12), 2012, : 545 - 550
  • [45] A Lightweight Speaker Verification Model For Edge Device
    Chen, Ting-Wei
    Chen, Chia-Ping
    Lu, Chung-Li
    Chan, Bo-Cheng
    Cheng, Yu-Han
    Chuang, Hsiang-Feng
    Chen, Wei-Yu
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1372 - 1377
  • [46] Method for adaptive training of polynomial networks with applications to speaker verification
    Campbell, WM
    Broun, CC
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1510 - 1515
  • [47] Speaker Verification Based on Channel Attention and Adaptive Joint Loss
    Fan, Houbin
    Li, Jun
    Ge, Fengpei
    Liang, Chunyan
    ELECTRONICS, 2025, 14 (03):
  • [48] PHONE ADAPTIVE TRAINING FOR SHORT-DURATION SPEAKER VERIFICATION
    Soldi, Giovanni
    Bozonnet, Simon
    Beaugeant, Christophe
    Evans, Nicholas
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2107 - 2111
  • [49] PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification
    Zheng, Siqi
    Suo, Hongbin
    Chen, Qian
    INTERSPEECH 2022, 2022, : 1431 - 1435