Acoustic Factor Analysis for Streamed Hidden Markov Modeling

被引:1
|
作者
Chien, Jen-Tzung [1 ]
Ting, Chuan-Wei [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
Factor analysis (FA); Markov chain; streamed hidden Markov model; speech recognition; MAXIMUM-LIKELIHOOD; SPEECH; COMBINATION;
D O I
10.1109/TASL.2009.2014891
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a novel streamed hidden Markov model (HMM) framework for speech recognition. The factor analysis (FA) principle is adopted to explore the common factors from acoustic features. The streaming regularities in building HMMs are governed by the correlation between cepstral features, which is inherent in common factors. Those features corresponding to the same factor are generated by the identical HMM state. Accordingly, the multiple Markov chains are adopted to characterize the variation trends in different dimensions of cepstral vectors. An FA streamed HMM (FASHMM) method is developed to relax the assumption of standard HMM topology, namely, that all features of a speech frame perform the same state emission. The proposed FASHMMis more flexible than the streamed factorial HMM (SFHMM) where the streaming was empirically determined. To reduce the number of factor loading matrices in FA, we evaluated the similarity between individual matrices to find the optimal solution to parameter clustering of FA models. A new decoding algorithm was presented to perform FASHMM speech recognition. FASHMM carries out the streamed Markov chains for a sequence of multivariate Gaussian mixture observations through the state transitions of the partitioned vectors. In the experiments, the proposed method reduced the recognition error rates significantly when compared with the standard HMM and SFHMM methods.
引用
收藏
页码:1279 / 1291
页数:13
相关论文
共 50 条
  • [1] Factor analysis of acoustic features for streamed hidden Markov modeling
    Ting, Chuan-Wei
    Chien, Jen-Tzung
    [J]. 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 30 - 35
  • [2] Using Hidden Markov Modeling for Biogeographical Ancestry Analysis
    Currie, Melvin R.
    [J]. JOURNAL OF HUMANISTIC MATHEMATICS, 2019, 9 (02):
  • [3] Hidden Markov Acoustic Modeling With Bootstrap and Restructuring for Low-Resourced Languages
    Cui, Xiaodong
    Xue, Jian
    Chen, Xin
    Olsen, Peder A.
    Dognin, Pierre L.
    Chaudhari, Upendra V.
    Hershey, John R.
    Zhou, Bowen
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (08): : 2252 - 2264
  • [4] Hidden Markov Modeling with HMMTeacher
    Fuentes-Beals, Camilo
    Valdes-Jimenez, Alejandro
    Riadi, Gonzalo
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (02)
  • [5] On Modeling JANUS Packet Errors over a Shallow Water Acoustic Channel using Markov and Hidden Markov Models
    Tomasi, Beatrice
    Casari, Paolo
    Finesso, Lorenzo
    Zappa, Giovanni
    McCoy, Kim
    Zorzi, Michele
    [J]. MILITARY COMMUNICATIONS CONFERENCE, 2010 (MILCOM 2010), 2010, : 2406 - 2411
  • [6] A profile hidden markov model framework for modeling and analysis ofshape
    Huang, Rui
    Pavlovic, Vladimir
    Metaxas, Dimitris N.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 2121 - +
  • [7] MODELING ACOUSTIC TRANSITIONS IN SPEECH BY STATE-INTERPOLATION HIDDEN MARKOV-MODELS
    LI, D
    KENNY, P
    LENNIG, M
    MERMELSTEIN, P
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (02) : 265 - 271
  • [8] Hidden-Markov Factor Analysis as a Spatiotemporal Model for Electrocorticography
    Omigbodun, Akinyinka
    Doyle, Werner K.
    Devinsky, Orrin
    Friedman, Daniel
    Thesen, Thomas
    Gilja, Vikash
    [J]. 2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 1632 - 1635
  • [9] Modeling acoustic correlations by factor analysis
    Saul, L
    Rahim, M
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 749 - 755
  • [10] HIDDEN MARKOV MODELING OVER GRAPHS
    Kayaalp, Mert
    Bordignon, Virginia
    Vlaski, Stefan
    Sayed, Ali H.
    [J]. 2022 IEEE DATA SCIENCE AND LEARNING WORKSHOP (DSLW), 2022,