DISCRIMINATIVE FEATURE TRANSFORMS USING DIFFERENCED MAXIMUM MUTUAL INFORMATION

被引:0
|
作者
Delcroix, Marc [1 ]
Ogawa, Atsunori [1 ]
Watanabe, Shinji [1 ]
Nakatani, Tomohiro [1 ]
Nakamura, Atsushi [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Keihanna Sci City, Kyoto 6190237, Japan
关键词
Speech recognition; discriminative training; discriminative feature transforms; differenced MMI;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently feature compensation techniques that train feature transforms using a discriminative criterion have attracted much interest in the speech recognition community. Typically, the acoustic feature space is modeled by a Gaussian mixture model (GMM), and a feature transform is assigned to each Gaussian of the GMM. Feature compensation is then performed by transforming features using the transformation associated with each Gaussian, then summing up the transformed features weighted by the posterior probability of each Gaussian. Several discriminative criteria have been investigated for estimating the feature transformation parameters including maximum mutual information (MMI) and minimum phone error (MPE). Recently, the differenced MMI (dMMI) criterion that generalizes MMI and MPE, has been shown to provide competitive performance for acoustic model training. In this paper, we investigate the use of the dMMI criterion for discriminative feature transforms and demonstrate in a noisy speech recognition experiment that dMMI achieves recognition performance superior to that of MMI or MPE.
引用
收藏
页码:4753 / 4756
页数:4
相关论文
共 50 条
  • [1] Nonlinear feature transforms using maximum mutual information
    Torkkola, K
    [J]. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 2756 - 2761
  • [2] UNSUPERVISED DISCRIMINATIVE ADAPTATION USING DIFFERENCED MAXIMUM MUTUAL INFORMATION BASED LINEAR REGRESSION
    Delcroix, Marc
    Ogawa, Atsunori
    Hahm, Seong-Jun
    Nakatani, Tomohiro
    Nakamura, Atsushi
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7888 - 7892
  • [3] Maximally discriminative spectral feature projections using mutual information
    Ozertem, U
    Erdogmus, D
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 208 - 213
  • [4] Differenced maximum mutual information criterion for robust unsupervised acoustic model adaptation
    Delcroix, Marc
    Ogawa, Atsunori
    Hahm, Seong-Jun
    Nakatani, Tomohiro
    Nakamura, Atsushi
    [J]. COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 24 - 41
  • [5] Discriminative training of GMM based on Maximum Mutual Information for language identification
    Qu Dan
    Wang Bingxi
    Yan Honggang
    Dai Guannan
    [J]. WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 1576 - +
  • [6] Mutual Information Regularized Feature-Level Frankenstein for Discriminative Recognition
    Liu, Xiaofeng
    Chao, Yang
    You, Jane J.
    Kuo, C-C Jay
    Kumar, B. V. K. Vijaya
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5243 - 5260
  • [7] Using The Maximum Mutual Information Criterion To Textural Feature Selection For Satellite Image Classification
    Kerroum, Mounir Ait
    Hammouch, Ahmed
    Aboutajdine, Driss
    Bellaachia, Abdelghani
    [J]. 2008 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1-3, 2008, : 584 - +
  • [8] Discriminative feature-space transforms using deep neural networks
    Saon, George
    Kingsbury, Brian
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 14 - 17
  • [9] Feature Selection Using Maximum Feature Tree Embedded with Mutual Information and Coefficient of Variation for Bird Sound Classification
    Xu, Haifeng
    Zhang, Yan
    Liu, Jiang
    Lv, Danjv
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [10] How to train a discriminative front end with stochastic gradient descent and maximum mutual information
    Droppo, J
    Mahajan, M
    Gunawardana, A
    Acero, A
    [J]. 2005 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2005, : 41 - 46