Discriminative training of GMM based on Maximum Mutual Information for language identification

被引：0

作者：

Qu Dan ^{[1
]}

Wang Bingxi ^{[1
]}

Yan Honggang ^{[1
]}

Dai Guannan ^{[1
]}

机构：

[1] Informat Engn Univ, Dept Signal Analyzing Engn, 837,POB 1001, Zhengzhou 450002, Peoples R China

来源：

WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS | 2006年

基金：

中国国家自然科学基金;

关键词：

Maximum Mutual Information(MMI); Gaussian Mixture Model(GMM); Generalized Probabilistic Descent (GPD); language identification;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, a discriminative training procedures based on Maximum Mutual Information(MMI) for a Gaussian Mixture Model (GMM) language identification system is described. The idea is to find the model parameters lambda that minimize the conditional entropy H-lambda (C vertical bar X) of the random variable C given the random variable X, which means minimize the uncertainty in knowing what language was spoken given access to the utterance in X. The implementation of the proposal is based on the Generalized Probabilistic Descent (GPD) algorithm formulated to estimate the GMM parameters. The evaluation is conducted using the OGI multi-language telephone speech corpus. The experimental results show such system is very effective in language identification tasks.

引用

页码：1576 / +

页数：2

共 50 条

[1] Two discriminative training schemes of GMM for language identification
Qu, D
Wang, BX
Zhang, Q
[J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 630 - 633
[2] Discriminative training of GMM for speaker identification
delAlamo, CM
Gil, FJC
Munilla, CDL
Gomez, LH
[J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 89 - 92
[3] Constrained Maximum Mutual Information Dimensionality Reduction for Language Identification
Huang, Shuai
Coppersmith, Glen A.
Karakos, Damianos
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2035 - 2038
[4] Discriminative training for speaker identification based on maximum model distance algorithm
Hong, QY
Kwong, S
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 25 - 28
[5] DISCRIMINATIVE FEATURE TRANSFORMS USING DIFFERENCED MAXIMUM MUTUAL INFORMATION
Delcroix, Marc
Ogawa, Atsunori
Watanabe, Shinji
Nakatani, Tomohiro
Nakamura, Atsushi
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4753 - 4756
[6] UNSUPERVISED DISCRIMINATIVE ADAPTATION USING DIFFERENCED MAXIMUM MUTUAL INFORMATION BASED LINEAR REGRESSION
Delcroix, Marc
Ogawa, Atsunori
Hahm, Seong-Jun
Nakatani, Tomohiro
Nakamura, Atsushi
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7888 - 7892
[7] Discriminative training techniques for acoustic language identification
Burget, Lukas
Matejka, Pavel
Cernocky, Jan
[J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 209 - 212
[8] On maximum mutual information speaker-adapted training
McDonough, John
Woelfel, Matthias
Stoimenov, Emilian
[J]. COMPUTER SPEECH AND LANGUAGE, 2008, 22 (02): : 130 - 147
[9] On maximum mutual information speaker-adapted training
McDonough, J
Schaaf, T
Waibel, A
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 601 - 604
[10] Acoustic Language Identification Using Fast Discriminative Training
Castaldo, Fabio
Colibro, Daniele
Dalmasso, Emanuele
Laface, Pietro
Vair, Claudio
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 389 - +

← 1 2 3 4 5 →