Factor Analyzed HMM Topology for Speech Recognition

被引:0
|
作者
Ting, Chuan-Wei [1 ]
Chien, Jen-Tzung [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
factor analysis; similarity measure; HMM topology; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new factor analyzed (FA) similarity measure between two Gaussian mixture models (GMMs). An adaptive hidden Markov model (HMM) topology is built to compensate the pronunciation variations in speech recognition. Our idea aims to evaluate whether the variation of a HMM state from new speech data is significant or not and judge if a new state should be generated in the models. Due to the effectiveness of FA data analysis, we measure the GMM similarity by estimating the common factors and specific factors embedded in the HMM means and variances. Similar Gaussian densities are represented by the common factors. Specific factors express the residual of similarity measure. We perform a composite hypothesis test due to common factors as well as specific factors. An adaptive HMM topology is accordingly established from continuous collection of training utterances. Experiments show that the proposed FA measure outperforms other measures with comparable size of parameters.
引用
收藏
页码:1407 / 1410
页数:4
相关论文
共 50 条
  • [31] A HMM speech recognition system based on FPGA
    Ke, Sujuan
    Hou, Yibin
    Huang, Zhangqin
    Li, Hui
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 305 - 309
  • [32] A Mongolian speech recognition system based on HMM
    Gao, Guanglai
    Biligetu
    Nabuqing
    Zhang, Shuwu
    COMPUTATIONAL INTELLIGENCE, PT 2, PROCEEDINGS, 2006, 4114 : 667 - 676
  • [33] An HMM-based speech recognition IC
    Han, W
    Hon, KW
    Chan, CF
    Lee, T
    Choy, CS
    Pun, KP
    Ching, PC
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II: COMMUNICATIONS-MULTIMEDIA SYSTEMS & APPLICATIONS, 2003, : 744 - 747
  • [34] Speech Recognition Using HMM-CNN
    Santos, Lyndaines
    Moreira, Nicolas de Araujo
    Sampaio, Robson
    Lima, Raizielle
    Mattos Brito Oliveira, Francisco Carlos
    INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, WORLDCIST 2023, 2024, 799 : 528 - 537
  • [35] Speech recognition with HMM models for cochlear prostheses
    Sakka, Z
    Kachouri, A
    Samet, M
    2004 IEEE International Conference on Industrial Technology (ICIT), Vols. 1- 3, 2004, : 1478 - 1481
  • [36] HMM/NN hybrids for continuous speech recognition
    Alim, OAA
    Elboghdadly, N
    El Shaar, NM
    PROCEEDINGS OF THE EIGHTEENTH NATIONAL RADIO SCIENCE CONFERENCE, VOLS 1 AND 2, 2001, : 509 - 516
  • [37] Hybrid modeling of PHMM and HMM for speech recognition
    Ogawa, T
    Kobayashi, T
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 140 - 143
  • [38] DELETED SMOOTHING OF HMM PARAMETERS IN SPEECH RECOGNITION
    KIM, NS
    UN, CK
    ELECTRONICS LETTERS, 1993, 29 (09) : 735 - 736
  • [39] Automatic Generation of HMM Topology for Sign Language Recognition
    Matsuo, Tadashi
    Shirai, Yoshiaki
    Shimada, Nobutaka
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 920 - 923
  • [40] A Novel Visual Speech Representation and HMM Classification for Visual Speech Recognition
    Yu, Dahai
    Ghita, Ovidiu
    Sutherland, Alistair
    Whelan, Paul F.
    ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PROCEEDINGS, 2009, 5414 : 398 - 409