Factor Analyzed HMM Topology for Speech Recognition

被引:0
|
作者
Ting, Chuan-Wei [1 ]
Chien, Jen-Tzung [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
factor analysis; similarity measure; HMM topology; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new factor analyzed (FA) similarity measure between two Gaussian mixture models (GMMs). An adaptive hidden Markov model (HMM) topology is built to compensate the pronunciation variations in speech recognition. Our idea aims to evaluate whether the variation of a HMM state from new speech data is significant or not and judge if a new state should be generated in the models. Due to the effectiveness of FA data analysis, we measure the GMM similarity by estimating the common factors and specific factors embedded in the HMM means and variances. Similar Gaussian densities are represented by the common factors. Specific factors express the residual of similarity measure. We perform a composite hypothesis test due to common factors as well as specific factors. An adaptive HMM topology is accordingly established from continuous collection of training utterances. Experiments show that the proposed FA measure outperforms other measures with comparable size of parameters.
引用
收藏
页码:1407 / 1410
页数:4
相关论文
共 50 条
  • [21] Research and Analysis of HMM Speech Recognition Model
    Zhu Shu-qin
    Wei Shao-qian
    Zhang Yin-xia
    2012 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2012), 2012, 12 : 481 - 484
  • [22] The Teaching Experiment of Speech Recognition based on HMM
    An, Mingjia
    Yu, Zhengtao
    Guo, Jianyi
    Gao, Shengxiang
    Xian, Yantuan
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 2416 - 2420
  • [23] Speech recognition using HMM and Soft Computing
    Srivastava, R. K.
    Pandey, Digesh
    MATERIALS TODAY-PROCEEDINGS, 2022, 51 : 1878 - 1883
  • [24] EFFICIENT HMM EVALUATION FOR RECOGNITION OF NONVERBAL SPEECH
    DELLER, JR
    SNIDER, RK
    IMAGES OF THE TWENTY-FIRST CENTURY, PTS 1-6, 1989, 11 : 657 - 658
  • [25] The application of Speech Recognition Technology based on HMM
    Yan, Guilin
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INFORMATION SCIENCES, MACHINERY, MATERIALS AND ENERGY (ICISMME 2015), 2015, 126 : 676 - 679
  • [26] Smoothed unit HMM in mandarin speech recognition
    He, Q
    Mao, SY
    Zhang, YW
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 792 - 795
  • [27] FUZZY SMOOTHING OF HMM PARAMETERS IN SPEECH RECOGNITION
    KOO, JM
    UN, CK
    ELECTRONICS LETTERS, 1990, 26 (11) : 743 - 744
  • [28] A method to combine HMM and BPNN on speech recognition
    Huang, Wan-Chen
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 1899 - 1902
  • [29] Connectionist Probability Estimators in HMM Speech Recognition
    Renals, Steve
    Morgan, Nelson
    Bourlard, Herve
    Cohen, Michael
    Franco, Horacio
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 161 - 174
  • [30] Isarn Digit Speech Recognition using HMM
    Sangjamraschaikun, Sasithron
    Seresangtakul, Pusadee
    2017 2ND INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (INCIT), 2017, : 18 - 22