Discriminative training of GMM based on Maximum Mutual Information for language identification

被引:0
|
作者
Qu Dan [1 ]
Wang Bingxi [1 ]
Yan Honggang [1 ]
Dai Guannan [1 ]
机构
[1] Informat Engn Univ, Dept Signal Analyzing Engn, 837,POB 1001, Zhengzhou 450002, Peoples R China
基金
中国国家自然科学基金;
关键词
Maximum Mutual Information(MMI); Gaussian Mixture Model(GMM); Generalized Probabilistic Descent (GPD); language identification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a discriminative training procedures based on Maximum Mutual Information(MMI) for a Gaussian Mixture Model (GMM) language identification system is described. The idea is to find the model parameters lambda that minimize the conditional entropy H-lambda (C vertical bar X) of the random variable C given the random variable X, which means minimize the uncertainty in knowing what language was spoken given access to the utterance in X. The implementation of the proposal is based on the Generalized Probabilistic Descent (GPD) algorithm formulated to estimate the GMM parameters. The evaluation is conducted using the OGI multi-language telephone speech corpus. The experimental results show such system is very effective in language identification tasks.
引用
收藏
页码:1576 / +
页数:2
相关论文
共 50 条
  • [1] Two discriminative training schemes of GMM for language identification
    Qu, D
    Wang, BX
    Zhang, Q
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 630 - 633
  • [2] Discriminative training of GMM for speaker identification
    delAlamo, CM
    Gil, FJC
    Munilla, CDL
    Gomez, LH
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 89 - 92
  • [3] Constrained Maximum Mutual Information Dimensionality Reduction for Language Identification
    Huang, Shuai
    Coppersmith, Glen A.
    Karakos, Damianos
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2035 - 2038
  • [4] Discriminative training for speaker identification based on maximum model distance algorithm
    Hong, QY
    Kwong, S
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 25 - 28
  • [5] DISCRIMINATIVE FEATURE TRANSFORMS USING DIFFERENCED MAXIMUM MUTUAL INFORMATION
    Delcroix, Marc
    Ogawa, Atsunori
    Watanabe, Shinji
    Nakatani, Tomohiro
    Nakamura, Atsushi
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4753 - 4756
  • [6] UNSUPERVISED DISCRIMINATIVE ADAPTATION USING DIFFERENCED MAXIMUM MUTUAL INFORMATION BASED LINEAR REGRESSION
    Delcroix, Marc
    Ogawa, Atsunori
    Hahm, Seong-Jun
    Nakatani, Tomohiro
    Nakamura, Atsushi
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7888 - 7892
  • [7] Discriminative training techniques for acoustic language identification
    Burget, Lukas
    Matejka, Pavel
    Cernocky, Jan
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 209 - 212
  • [8] On maximum mutual information speaker-adapted training
    McDonough, John
    Woelfel, Matthias
    Stoimenov, Emilian
    [J]. COMPUTER SPEECH AND LANGUAGE, 2008, 22 (02): : 130 - 147
  • [9] On maximum mutual information speaker-adapted training
    McDonough, J
    Schaaf, T
    Waibel, A
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 601 - 604
  • [10] Acoustic Language Identification Using Fast Discriminative Training
    Castaldo, Fabio
    Colibro, Daniele
    Dalmasso, Emanuele
    Laface, Pietro
    Vair, Claudio
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 389 - +