Factor Analyzed HMM Topology for Speech Recognition

被引:0
|
作者
Ting, Chuan-Wei [1 ]
Chien, Jen-Tzung [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
factor analysis; similarity measure; HMM topology; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new factor analyzed (FA) similarity measure between two Gaussian mixture models (GMMs). An adaptive hidden Markov model (HMM) topology is built to compensate the pronunciation variations in speech recognition. Our idea aims to evaluate whether the variation of a HMM state from new speech data is significant or not and judge if a new state should be generated in the models. Due to the effectiveness of FA data analysis, we measure the GMM similarity by estimating the common factors and specific factors embedded in the HMM means and variances. Similar Gaussian densities are represented by the common factors. Specific factors express the residual of similarity measure. We perform a composite hypothesis test due to common factors as well as specific factors. An adaptive HMM topology is accordingly established from continuous collection of training utterances. Experiments show that the proposed FA measure outperforms other measures with comparable size of parameters.
引用
收藏
页码:1407 / 1410
页数:4
相关论文
共 50 条
  • [1] Generative factor analyzed HMM for automatic speech recognition
    Yao, KS
    Paliwal, KK
    Lee, TW
    SPEECH COMMUNICATION, 2005, 45 (04) : 435 - 454
  • [2] Adaptive HMM Topology for Speech Recognition
    Ting, Chuan-Wei
    Lee, Kuo-Yuan
    Chien, Jen-Tzung
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1237 - 1240
  • [3] HMM Topology in Continuous Speech Recognition Systems
    Yared, Glauco F. G.
    Violaro, Fabio
    Selmini, Antonio Marcos
    PROCEEDINGS OF THE IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 2006, : 651 - 656
  • [4] FACTOR ANALYZED VOICE MODELS FOR HMM-BASED SPEECH SYNTHESIS
    Kazumi, Kyosuke
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4234 - 4237
  • [5] Simultaneous Optimization of Multiple Tree-Based Factor Analyzed HMM for Speech Synthesis
    Yoshimura, Takenori
    Hashimoto, Kei
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (09) : 1532 - 1541
  • [6] Hmm topology optimization for handwriting recognition
    Li, DF
    Biem, A
    Subrahmonia, J
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 1521 - 1524
  • [7] A new HMM topology for shape recognition
    Arica, N
    Yarman-Vural, FT
    PROCEEDINGS OF THE IEEE-EURASIP WORKSHOP ON NONLINEAR SIGNAL AND IMAGE PROCESSING (NSIP'99), 1999, : 756 - 760
  • [8] Simultaneous Optimization of Multiple Tree Structures for Factor Analyzed HMM-Based Speech Synthesis
    Yoshimura, Takenori
    Hashimoto, Kei
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1196 - 1200
  • [9] Discriminant initialization for factor analyzed HMM training
    Lefevre, Fabrice
    Gauvain, Jean-Luc
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 285 - 288
  • [10] An improved HMM speech recognition model
    Yuan, Lichi
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1311 - 1315