DISCRIMINATIVE FEATURE TRANSFORMS USING DIFFERENCED MAXIMUM MUTUAL INFORMATION

被引：0

作者：

Delcroix, Marc ^{[1
]}

Ogawa, Atsunori ^{[1
]}

Watanabe, Shinji ^{[1
]}

Nakatani, Tomohiro ^{[1
]}

Nakamura, Atsushi ^{[1
]}

机构：

[1] NTT Corp, NTT Commun Sci Labs, Keihanna Sci City, Kyoto 6190237, Japan

来源：

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年

关键词：

Speech recognition; discriminative training; discriminative feature transforms; differenced MMI;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Recently feature compensation techniques that train feature transforms using a discriminative criterion have attracted much interest in the speech recognition community. Typically, the acoustic feature space is modeled by a Gaussian mixture model (GMM), and a feature transform is assigned to each Gaussian of the GMM. Feature compensation is then performed by transforming features using the transformation associated with each Gaussian, then summing up the transformed features weighted by the posterior probability of each Gaussian. Several discriminative criteria have been investigated for estimating the feature transformation parameters including maximum mutual information (MMI) and minimum phone error (MPE). Recently, the differenced MMI (dMMI) criterion that generalizes MMI and MPE, has been shown to provide competitive performance for acoustic model training. In this paper, we investigate the use of the dMMI criterion for discriminative feature transforms and demonstrate in a noisy speech recognition experiment that dMMI achieves recognition performance superior to that of MMI or MPE.

引用

页码：4753 / 4756

页数：4

共 50 条

[1] Nonlinear feature transforms using maximum mutual information
Torkkola, K
[J]. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 2756 - 2761
[2] UNSUPERVISED DISCRIMINATIVE ADAPTATION USING DIFFERENCED MAXIMUM MUTUAL INFORMATION BASED LINEAR REGRESSION
Delcroix, Marc
Ogawa, Atsunori
Hahm, Seong-Jun
Nakatani, Tomohiro
Nakamura, Atsushi
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7888 - 7892
[3] Maximally discriminative spectral feature projections using mutual information
Ozertem, U
Erdogmus, D
[J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 208 - 213
[4] Differenced maximum mutual information criterion for robust unsupervised acoustic model adaptation
Delcroix, Marc
Ogawa, Atsunori
Hahm, Seong-Jun
Nakatani, Tomohiro
Nakamura, Atsushi
[J]. COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 24 - 41
[5] Discriminative training of GMM based on Maximum Mutual Information for language identification
Qu Dan
Wang Bingxi
Yan Honggang
Dai Guannan
[J]. WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 1576 - +
[6] Mutual Information Regularized Feature-Level Frankenstein for Discriminative Recognition
Liu, Xiaofeng
Chao, Yang
You, Jane J.
Kuo, C-C Jay
Kumar, B. V. K. Vijaya
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5243 - 5260
[7] Using The Maximum Mutual Information Criterion To Textural Feature Selection For Satellite Image Classification
Kerroum, Mounir Ait
Hammouch, Ahmed
Aboutajdine, Driss
Bellaachia, Abdelghani
[J]. 2008 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1-3, 2008, : 584 - +
[8] Discriminative feature-space transforms using deep neural networks
Saon, George
Kingsbury, Brian
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 14 - 17
[9] Feature Selection Using Maximum Feature Tree Embedded with Mutual Information and Coefficient of Variation for Bird Sound Classification
Xu, Haifeng
Zhang, Yan
Liu, Jiang
Lv, Danjv
[J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
[10] How to train a discriminative front end with stochastic gradient descent and maximum mutual information
Droppo, J
Mahajan, M
Gunawardana, A
Acero, A
[J]. 2005 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2005, : 41 - 46

← 1 2 3 4 5 →