Gaussian Mixture Optimization Based on Efficient Cross-Validation

被引:6
|
作者
Shinozaki, Takahiro [1 ]
Furui, Sadaoki [1 ]
Kawahara, Tatsuya [2 ]
机构
[1] Tokyo Inst Technol, Dept Comp Sci, Tokyo 1528552, Japan
[2] Kyoto Univ, Acad Ctr Comp & Media Studies, Kyoto 6068501, Japan
关键词
Cross-validation; Gaussian mixture; hidden Markov model (HMM); speech recognition; sufficient statistics;
D O I
10.1109/JSTSP.2010.2048235
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A Gaussian mixture optimization method is developed by using the cross-validation (CV) likelihood as an objective function instead of the conventional training set likelihood. The optimization is based on reducing the number of mixture components by selecting and merging pairs of Gaussians step by step according to the objective function so as to remove redundant components and improve the generality of the model. The CV likelihood is more effective for avoiding over-fitting than is the conventional likelihood, and it provides a termination criterion that does not rely on empirical thresholds. While the idea is simple, one problem is its infeasible computational cost. To make such optimization practical, an efficient evaluation algorithm using sufficient statistics is proposed. In addition, aggregated CV (AgCV) is developed to further improve the generalization performance of CV. Large-vocabulary speech recognition experiments on oral presentations show that the proposed methods improve speech recognition performance with automatically determined model complexity. The AgCV-based optimization is computationally more expensive than the CV-based method but gives better recognition performance.
引用
收藏
页码:540 / 547
页数:8
相关论文
共 50 条
  • [1] Gaussian Mixture Optimization for HMM based on Efficient Cross-validation
    Shinozaki, Takahiro
    Kawahara, Tatsuya
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 653 - 656
  • [2] Aggregated Cross-validation and Its Efficient Application to Gaussian Mixture Optimization
    Shinozaki, Takahiro
    Furui, Sadaoki
    Kawahara, Tatsuya
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2382 - +
  • [3] Robust Estimation of Free Energy Landscapes from Gaussian Mixture Models with Cross-Validation
    Delemotte, Lucie
    Westerlund, Annie M.
    Blau, Christian
    BIOPHYSICAL JOURNAL, 2019, 116 (03) : 303A - 303A
  • [4] HMM state clustering based on efficient cross-validation
    Shinozaki, T.
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1157 - 1160
  • [5] Efficient cross-validation of principal components
    Krzanowski, WJ
    STATISTICS AND COMPUTING, 1996, 6 (02) : 177 - 177
  • [6] Probabilistic Cross-Validation Estimators for Gaussian Process Regression
    Martino, Luca
    Laparra, Valero
    Camps-Valls, Gustau
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 823 - 827
  • [7] Cross-validation method for bivariate measure with certain mixture
    Sabre, Rachid
    4TH INTERNATIONAL CONFERENCE ON SCIENCE & ENGINEERING IN MATHEMATICS, CHEMISTRY AND PHYSICS 2016 (SCIETECH 2016), 2016, 710
  • [8] Cross-validation is dead. Long live cross-validation! Model validation based on resampling
    Knut Baumann
    Journal of Cheminformatics, 2 (Suppl 1)
  • [9] Efficient algorithms for decision tree cross-validation
    Blockeel, H
    Struyf, J
    JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 621 - 650
  • [10] Efficient Cross-Validation of Echo State Networks
    Lukosevicius, Mantas
    Uselis, Arnas
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 121 - 133