Hierarchical large-margin Gaussian mixture models for phonetic classification

被引:14
|
作者
Chang, Hung-An [1 ]
Glass, James R. [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
hierarchical classifier; committee classifier; large margin GMM; phonetic classification;
D O I
10.1109/ASRU.2007.4430123
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a hierarchical large-margin Gaussian mixture modeling framework and evaluate it on the task of phonetic classification. A two-stage hierarchical. classifier is trained by alternately updating parameters at different levels in the tree to maximize the joint margin of the overall classification. Since the loss function required in the training is convex to the parameter space the problem of spurious local minima is avoided. The model achieves good performance with fewer parameters than single-level classifiers. In the TIMIT benchmark task of context-independent phonetic classification, the proposed modeling scheme achieves a state-of-the-art phonetic classification error of 16.7% on the core test set. This is an absolute reduction of 1.6% from the best previously reported result on this task, and 4-5% lower than a variety of classifiers that have been recently examined on this task.
引用
收藏
页码:272 / 277
页数:6
相关论文
共 50 条
  • [1] Loss-Scaled Large-Margin Gaussian Mixture Models for Speech Emotion Classification
    Yun, Sungrack
    Yoo, Chang D.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02): : 585 - 598
  • [2] Large margin Gaussian mixture modeling for phonetic classification and recognition
    Sha, Fei
    Saul, Lawrence K.
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 265 - 268
  • [3] A Geometric Perspective of Large-Margin Training of Gaussian Models
    Xiao, Lin
    Deng, Li
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2010, 27 (06) : 118 - 123
  • [4] Hierarchical learning of large-margin metrics for large-scale image classification
    Lei, Hao
    Mei, Kuizhi
    Xin, Jingmin
    Dong, Peixiang
    Fan, Jianping
    [J]. NEUROCOMPUTING, 2016, 208 : 46 - 58
  • [5] Large-Margin Classification in Hyperbolic Space
    Cho, Hyunghoon
    DeMeo, Benjamin
    Peng, Jian
    Berger, Bonnie
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [6] VARIABILITY REGULARIZATION IN LARGE-MARGIN CLASSIFICATION
    Mansjur, Dwi Sianto
    Wada, Ted S.
    Juang, Biing-Hwang
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1956 - 1959
  • [7] Large Margin Gaussian Mixture Models with Differential Privacy
    Pathak, Manas A.
    Raj, Bhiksha
    [J]. IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2012, 9 (04) : 463 - 469
  • [8] Large Margin Gaussian mixture models for speaker identification
    Jourani, Reda
    Daoudi, Khalid
    Andre-Obrecht, Regine
    Aboutajdine, Driss
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1441 - +
  • [9] Large-Margin Classification in Infinite Neural Networks
    Cho, Youngmin
    Saul, Lawrence K.
    [J]. NEURAL COMPUTATION, 2010, 22 (10) : 2678 - 2697
  • [10] Large-margin multi-view Gaussian process
    Chang Xu
    Dacheng Tao
    Yangxi Li
    Chao Xu
    [J]. Multimedia Systems, 2015, 21 : 147 - 157