Adaptive Individual Background Model for Speaker Verification

被引：0

作者：

Bar-Yosef, Yossi ^{[1
]}

Bistritz, Yuval ^{[1
]}

机构：

[1] Tel Aviv Univ, Dept Elect Engn, IL-69978 Tel Aviv, Israel

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

Model adaptation; Gaussian Mixture Models; Kullback-Leibler divergence; speaker verification; cohort selection; score normalization;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most techniques for speaker verification today use Gaussian Mixture Models (GMMs) and make the decision by comparing the likelihood of the speaker model to the likelihood of a universal background model (UBM). The paper proposes to replace the UBM by an individual background model (IBM) that is generated for each speaker. The IBM is created using the K-nearest cohort models and the UBM by a simple new adaptation algorithm. The new GMM-IBM speaker verification system can also be combined with various score normalization techniques that have been proposed to increase the robustness of the GMM-UBM system. Comparative experiments were held on the NIST-2004-SRE database with a plain system setting (without score normalization) and also with the combination of adaptive test normalization (ATnorm). Results indicated that the proposed GMM-IBM system outperforms a comparable GMM-UBM system.

引用

页码：1279 / 1282

页数：4

共 50 条

[1] Speaker Model Clustering to Construct Background Models for Speaker Verification
Disken, Gokay
Tufekci, Zekeriya
Cevik, Ulus
ARCHIVES OF ACOUSTICS, 2017, 42 (01) : 127 - 135
[2] A Study on Universal Background Model Training in Speaker Verification
Hasan, Taufiq
Hansen, John H. L.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 1890 - 1899
[3] On the background model construction for speaker verification using GMM
Padrta, A
Radová, V
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 425 - 432
[4] Speaker verification without background speaker models
Hsu, CN
Yu, HC
Yang, BH
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 233 - 236
[5] Background model design for flexible and portable speaker verification systems
Siohan, O
Lee, CH
Surendran, AC
Li, Q
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 825 - 828
[6] Multiple Background Models for Speaker Verification
Zhang, Wei-Qiang
Shan, Yuxiang
Liu, Jia
ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 47 - 51
[7] Speaker verification based on speaker background model virtually synthesized using local acoustic information
Isobe, T
Takahashi, J
Nakamura, T
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2002, 85 (04): : 47 - 57
[8] Speaker background models for connected digit password speaker verification
Rosenberg, AE
Parthasarathy, S
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 81 - 84
[9] Emotional Adaptive Training for Speaker Verification
Bie, Fanhu
Wang, Dong
Zheng, Thomas Fang
Tejedor, Javier
Chen, Ruxin
2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
[10] Adaptive Rectangle Loss for Speaker Verification
Li, Ruida
Fang, Shuo
Ma, Chenguang
Li, Liang
INTERSPEECH 2022, 2022, : 301 - 305

← 1 2 3 4 5 →