Likelihood normalization using an ergodic HMM for continuous speech recognition

被引:0
|
作者
Ozeki, K
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In recent speech recognition technology, the score of a hypothesis is often defined on the basis of HMM likelihood. As is well known, however, direct use of the likelihood as a scoring function causes difficult problems especially when the length of a speech segment varies depending on the hypothesis as in word-spotting, and some kind of normalization is indispensable. In this paper, a new method of likelihood normalization using an ergodic HMM is presented, and its performance is compared with those of conventional ones. The comparison is made fr om three points of view: recognition rate, word-end detection power, and the mean hypothesis length. It is concluded that the proposed method. gives the best overall performance.
引用
收藏
页码:2301 / 2304
页数:4
相关论文
共 50 条
  • [21] Using SIMD technology to speed up likelihood computation in HMM-based speech recognition systems
    Ou, Jianlin
    Cai, Jun
    Lin, Qian
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 123 - 127
  • [22] Penalized Logistic Regression With HMM Log-Likelihood Regressors for Speech Recognition
    Birkenes, Oystein
    Matsui, Tomoko
    Tanabe, Kunio
    Siniscalchi, Sabato Marco
    Myrvoll, Tor Andre
    Johnsen, Magne Hallstein
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1440 - 1454
  • [23] A MAXIMUM-LIKELIHOOD APPROACH TO CONTINUOUS SPEECH RECOGNITION
    BAHL, LR
    JELINEK, F
    MERCER, RL
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1983, 5 (02) : 179 - 190
  • [24] Irrelevant variability normalization based HMM training using map estimation of feature transforms for robust speech recognition
    Zhu, Donglai
    Huo, Qiang
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4717 - +
  • [25] Emotion Recognition using Continuous Density HMM
    Anila, R.
    Revathy, A.
    2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2015, : 919 - 923
  • [26] Face recognition using Pseudo-2D Ergodic HMM
    Kumar, S. A. Santosh
    Deepti, D. R.
    Prabhakar, B.
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1617 - 1620
  • [27] A GMM/HMM model for reconstruction of missing speech spectral components for continuous speech recognition
    Goodarzi M.M.
    Almasganj F.
    International Journal of Speech Technology, 2016, 19 (4) : 769 - 777
  • [28] A comparison between HMM and hybrid ANN-HMM based systems for continuous speech recognition
    Ynoguti, CA
    Morais, ED
    Violaro, F
    ITS '98 PROCEEDINGS - SBT/IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 1998, : 135 - 140
  • [29] A One-Step Tone Recognition Approach Using MSD-HMM for Continuous Speech
    Liu, Changliang
    Ge, Fengpei
    Pan, Fuping
    Dong, Bin
    Yan, Yonghong
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2975 - 2978
  • [30] Implementation of Embedded Unspecific Continuous English Speech Recognition Based on HMM
    Lu, Xiaoli
    Shah, Mohd Asif
    RECENT ADVANCES IN ELECTRICAL & ELECTRONIC ENGINEERING, 2021, 14 (06) : 649 - 659