Adaptive Individual Background Model for Speaker Verification

被引：0

作者：

Bar-Yosef, Yossi ^{[1
]}

Bistritz, Yuval ^{[1
]}

机构：

[1] Tel Aviv Univ, Dept Elect Engn, IL-69978 Tel Aviv, Israel

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

Model adaptation; Gaussian Mixture Models; Kullback-Leibler divergence; speaker verification; cohort selection; score normalization;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most techniques for speaker verification today use Gaussian Mixture Models (GMMs) and make the decision by comparing the likelihood of the speaker model to the likelihood of a universal background model (UBM). The paper proposes to replace the UBM by an individual background model (IBM) that is generated for each speaker. The IBM is created using the K-nearest cohort models and the UBM by a simple new adaptation algorithm. The new GMM-IBM speaker verification system can also be combined with various score normalization techniques that have been proposed to increase the robustness of the GMM-UBM system. Comparative experiments were held on the NIST-2004-SRE database with a plain system setting (without score normalization) and also with the combination of adaptive test normalization (ATnorm). Results indicated that the proposed GMM-IBM system outperforms a comparable GMM-UBM system.

引用

页码：1279 / 1282

页数：4

共 50 条

[41] Comparison of model estimation techniques for speaker verification
Carey, MJ
Parris, ES
Bennett, SJ
LloydThomas, H
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1083 - 1086
[42] Speech Enhancement Regularized by a Speaker Verification Model
Lay, Bunlong
Gerkmann, Timo
2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
[43] Speaker Verification Using Gaussian Mixture Model
Jagtap, Shilpa S.
Bhalke, D. G.
2015 INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING (ICPC), 2015,
[44] Optimal Impostor Model in Automatic Speaker Verification
Djellali, Hayet
Laskri, Mohamed Tayeb
PROCEEDINGS OF 2012 INTERNATIONAL CONFERENCE ON COMPLEX SYSTEMS (ICCS12), 2012, : 545 - 550
[45] A Lightweight Speaker Verification Model For Edge Device
Chen, Ting-Wei
Chen, Chia-Ping
Lu, Chung-Li
Chan, Bo-Cheng
Cheng, Yu-Han
Chuang, Hsiang-Feng
Chen, Wei-Yu
2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1372 - 1377
[46] Method for adaptive training of polynomial networks with applications to speaker verification
Campbell, WM
Broun, CC
IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1510 - 1515
[47] Speaker Verification Based on Channel Attention and Adaptive Joint Loss
Fan, Houbin
Li, Jun
Ge, Fengpei
Liang, Chunyan
ELECTRONICS, 2025, 14 (03):
[48] PHONE ADAPTIVE TRAINING FOR SHORT-DURATION SPEAKER VERIFICATION
Soldi, Giovanni
Bozonnet, Simon
Beaugeant, Christophe
Evans, Nicholas
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2107 - 2111
[49] PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification
Zheng, Siqi
Suo, Hongbin
Chen, Qian
INTERSPEECH 2022, 2022, : 1431 - 1435
[50] Cross similarity measurement for speaker adaptive test normalization in text-independent speaker verification
ZHAO Jian
The Journal of China Universities of Posts and Telecommunications, 2008, (02) : 130 - 134

← 1 2 3 4 5 →