STATE-DEPENDENT TIME WARPING IN THE TRENDED HIDDEN MARKOV MODEL

被引:9
|
作者
SUN, DX [1 ]
DENG, L [1 ]
WU, CFJ [1 ]
机构
[1] SUNY STONY BROOK,STONY BROOK,NY 11794
关键词
SPEECH SIGNAL; ACOUSTIC TRANSITION; SCALING; HIDDEN MARKOV MODEL; NONSTATIONARITY; TIME WARPING; AUXILIARY PARAMETER; VITERBI ALGORITHM;
D O I
10.1016/0165-1684(94)90089-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we present an algorithm for estimating state-dependent polynomial coefficients in the nonstationary-state hidden Markov model (or the trended HMM) which allows for the flexibility of linear time warping or scaling in individual model states. The need for the state-dependent time warping arises from the consideration that due to speaking rate variation and other temporal factors in speech, multiple state-segmented speech data sequences used for training a single set of polynomial coefficients often vary appreciably in their sequence lengths. The algorithm is developed based on a general framework with use of auxiliary parameters, which, of no interests in themselves, nevertheless provide an intermediate tool for achieving maximal accuracy for estimating the polynomial coefficients in the trended HMM. It is proved that the proposed estimation algorithm converges to a solution equivalent to the state-optimized maximum likelihood estimate. Effectiveness of the algorithm is demonstrated in experiments designed to fit a single trended HMM simultaneously to multiple sequences of speech data which are different renditions of the same word yet vary over a wide range in the sequence length. Speech recognition experiments have been performed based on the standard acoustic-phonetic TIMIT database. The speech recognition results demonstrate the advantages of the time-warping trended HMMs over the regular trended HMMs measured about 10 to 15% improvement in terms of the recognition rate.
引用
收藏
页码:263 / 275
页数:13
相关论文
共 50 条
  • [1] Hidden Markov models with state-dependent mixtures: minimal representation, model testing and applications to clustering
    Hajo Holzmann
    Florian Schwaiger
    [J]. Statistics and Computing, 2015, 25 : 1185 - 1200
  • [2] Hidden Markov models with state-dependent mixtures: minimal representation, model testing and applications to clustering
    Holzmann, Hajo
    Schwaiger, Florian
    [J]. STATISTICS AND COMPUTING, 2015, 25 (06) : 1185 - 1200
  • [3] Recursive estimation based on the trended hidden Markov model in speech enhancement
    Lee, KY
    Rheem, JY
    Shirai, K
    [J]. APCCAS '96 - IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS '96, 1996, : 239 - 242
  • [4] ON THE HIDDEN MARKOV MODEL AND DYNAMIC TIME WARPING FOR SPEECH RECOGNITION - A UNIFIED VIEW
    JUANG, BH
    [J]. AT&T BELL LABORATORIES TECHNICAL JOURNAL, 1984, 63 (07): : 1213 - 1243
  • [5] Clustering driver behavior using dynamic time warping and hidden Markov model
    Yao, Ying
    Zhao, Xiaohua
    Wu, Yiping
    Zhang, Yunlong
    Rong, Jian
    [J]. JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 25 (03) : 249 - 262
  • [6] An age- and state-dependent Markov model for degradation processes
    Giorgio, Massimiliano
    Guida, Maurizio
    Pulcini, Gianpaolo
    [J]. IIE TRANSACTIONS, 2011, 43 (09) : 621 - 632
  • [7] Harvesting on a State-Dependent Time Delay Model
    Ruijun XIE
    Xin ZHANG
    Wei ZHANG
    [J]. Journal of Systems Science and Information, 2020, 8 (01) : 82 - 96
  • [8] Trainable speech synthesis with trended Hidden Markov Models
    Dines, J
    Sridharan, S
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 833 - 836
  • [9] A maximum A posteriori approach to speaker adaptation using the trended hidden Markov model
    Chengalvarayan, R
    Deng, L
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05): : 549 - 557
  • [10] Wavefronts of a Nonlocal State-dependent Time Delay Model
    Xie, Rui Jun
    Yuan, Rong
    Yang, Zhi Hui
    [J]. ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2020, 36 (01) : 77 - 92