Gaussian Mixture Optimization for HMM based on Efficient Cross-validation

被引:0
|
作者
Shinozaki, Takahiro [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, Acad Ctr Comp & Media Studies, Kyoto, Japan
关键词
speech recognition; HMM; Gaussian mixture; cross-validation; sufficient statistics;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A Gaussian mixture optimization method is explored using cross-validation likelihood as an objective function instead of the conventional training set likelihood. The optimization is based on reducing the number of mixture components by selecting and merging a pair of Gaussians step by step base on the objective function so as to remove redundant components and improve the generality of the model. Cross-validation likelihood is more appropriate for avoiding over-fitting than the conventional likelihood and can be efficiently computed using sufficient statistics. It results in a better Gaussian pair selection and provides a termination criterion that does not rely on empirical thresholds. Large-vocabulary speech recognition experiments on oral presentations show that the cross-validation method gives a smaller word error rate with an automatically determined model size than a baseline training procedure that does not perform the optimization.
引用
收藏
页码:653 / 656
页数:4
相关论文
共 50 条
  • [1] Gaussian Mixture Optimization Based on Efficient Cross-Validation
    Shinozaki, Takahiro
    Furui, Sadaoki
    Kawahara, Tatsuya
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (03) : 540 - 547
  • [2] Aggregated Cross-validation and Its Efficient Application to Gaussian Mixture Optimization
    Shinozaki, Takahiro
    Furui, Sadaoki
    Kawahara, Tatsuya
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2382 - +
  • [3] HMM state clustering based on efficient cross-validation
    Shinozaki, T.
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1157 - 1160
  • [4] CROSS-VALIDATION BASED DECISION TREE CLUSTERING FOR HMM-BASED TTS
    Zhang, Yu
    Yan, Zhi-Jie
    Soong, Frank K.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4602 - 4605
  • [5] Robust Estimation of Free Energy Landscapes from Gaussian Mixture Models with Cross-Validation
    Delemotte, Lucie
    Westerlund, Annie M.
    Blau, Christian
    BIOPHYSICAL JOURNAL, 2019, 116 (03) : 303A - 303A
  • [6] HMM training based on CV-EM and CV Gaussian mixture optimization
    Shinozaki, Takahiro
    Kawahara, Tatsuya
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 318 - 322
  • [7] Efficient cross-validation of principal components
    Krzanowski, WJ
    STATISTICS AND COMPUTING, 1996, 6 (02) : 177 - 177
  • [8] Probabilistic Cross-Validation Estimators for Gaussian Process Regression
    Martino, Luca
    Laparra, Valero
    Camps-Valls, Gustau
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 823 - 827
  • [9] Cross-validation method for bivariate measure with certain mixture
    Sabre, Rachid
    4TH INTERNATIONAL CONFERENCE ON SCIENCE & ENGINEERING IN MATHEMATICS, CHEMISTRY AND PHYSICS 2016 (SCIETECH 2016), 2016, 710
  • [10] Cross-validation is dead. Long live cross-validation! Model validation based on resampling
    Knut Baumann
    Journal of Cheminformatics, 2 (Suppl 1)