Adaptive Individual Background Model for Speaker Verification

被引:0
|
作者
Bar-Yosef, Yossi [1 ]
Bistritz, Yuval [1 ]
机构
[1] Tel Aviv Univ, Dept Elect Engn, IL-69978 Tel Aviv, Israel
关键词
Model adaptation; Gaussian Mixture Models; Kullback-Leibler divergence; speaker verification; cohort selection; score normalization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most techniques for speaker verification today use Gaussian Mixture Models (GMMs) and make the decision by comparing the likelihood of the speaker model to the likelihood of a universal background model (UBM). The paper proposes to replace the UBM by an individual background model (IBM) that is generated for each speaker. The IBM is created using the K-nearest cohort models and the UBM by a simple new adaptation algorithm. The new GMM-IBM speaker verification system can also be combined with various score normalization techniques that have been proposed to increase the robustness of the GMM-UBM system. Comparative experiments were held on the NIST-2004-SRE database with a plain system setting (without score normalization) and also with the combination of adaptive test normalization (ATnorm). Results indicated that the proposed GMM-IBM system outperforms a comparable GMM-UBM system.
引用
收藏
页码:1279 / 1282
页数:4
相关论文
共 50 条
  • [1] Speaker Model Clustering to Construct Background Models for Speaker Verification
    Disken, Gokay
    Tufekci, Zekeriya
    Cevik, Ulus
    ARCHIVES OF ACOUSTICS, 2017, 42 (01) : 127 - 135
  • [2] A Study on Universal Background Model Training in Speaker Verification
    Hasan, Taufiq
    Hansen, John H. L.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 1890 - 1899
  • [3] On the background model construction for speaker verification using GMM
    Padrta, A
    Radová, V
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 425 - 432
  • [4] Speaker verification without background speaker models
    Hsu, CN
    Yu, HC
    Yang, BH
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 233 - 236
  • [5] Background model design for flexible and portable speaker verification systems
    Siohan, O
    Lee, CH
    Surendran, AC
    Li, Q
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 825 - 828
  • [6] Multiple Background Models for Speaker Verification
    Zhang, Wei-Qiang
    Shan, Yuxiang
    Liu, Jia
    ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 47 - 51
  • [7] Speaker verification based on speaker background model virtually synthesized using local acoustic information
    Isobe, T
    Takahashi, J
    Nakamura, T
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2002, 85 (04): : 47 - 57
  • [8] Speaker background models for connected digit password speaker verification
    Rosenberg, AE
    Parthasarathy, S
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 81 - 84
  • [9] Emotional Adaptive Training for Speaker Verification
    Bie, Fanhu
    Wang, Dong
    Zheng, Thomas Fang
    Tejedor, Javier
    Chen, Ruxin
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [10] Adaptive Rectangle Loss for Speaker Verification
    Li, Ruida
    Fang, Shuo
    Ma, Chenguang
    Li, Liang
    INTERSPEECH 2022, 2022, : 301 - 305