Likelihood normalization using an ergodic HMM for continuous speech recognition

被引:0
|
作者
Ozeki, K
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In recent speech recognition technology, the score of a hypothesis is often defined on the basis of HMM likelihood. As is well known, however, direct use of the likelihood as a scoring function causes difficult problems especially when the length of a speech segment varies depending on the hypothesis as in word-spotting, and some kind of normalization is indispensable. In this paper, a new method of likelihood normalization using an ergodic HMM is presented, and its performance is compared with those of conventional ones. The comparison is made fr om three points of view: recognition rate, word-end detection power, and the mean hypothesis length. It is concluded that the proposed method. gives the best overall performance.
引用
收藏
页码:2301 / 2304
页数:4
相关论文
共 50 条
  • [41] Scalable HMM based Inference Engine in Large Vocabulary Continuous Speech Recognition
    Chong, Jike
    You, Kisun
    Yi, Youngmin
    Gonina, Ekaterina
    Hughes, Christopher
    Sung, Wonyong
    Keutzer, Kurt
    [J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 1793 - +
  • [42] A NN/HMM hybrid for continuous speech recognition with a discriminant nonlinear feature extraction
    Rigoll, G
    Willett, D
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 9 - 12
  • [43] Combining TDNN and HMM in a Hybrid System for Improved Continuous-Speech Recognition
    Dugast, Christian
    Devillers, Laurence
    Aubert, Xavier
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 217 - 223
  • [44] The hybrid ANN/HMM method with double MLP structure for continuous speech recognition
    Lee, TZ
    Chen, DW
    [J]. PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 1096 - 1098
  • [45] HMM Based Continuous EOG Recognition for Eye-input Speech Interface
    Fang, Fuming
    Shinozaki, Takahiro
    Horiuchi, Yasuo
    Kuroiwa, Shingo
    Furui, Sadaoki
    Musha, Toshimitsu
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 734 - 737
  • [46] Hybrid continuous speech recognition systems by HMM, MLP and SVM: a comparative study
    Zarrouk, Elyes
    Ben Ayed, Yassine
    Gargouri, Faiez
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (03) : 223 - 233
  • [47] New feedback method of hybrid HMM/ANN methods for continuous speech recognition
    Lee, TZ
    Chen, DW
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 509 - 512
  • [48] Tree-Based HMM State Tying for Arabic Continuous Speech Recognition
    Azim, Mona A.
    Hamid, A. Aziz A.
    Badr, Nagwa L.
    Tolba, M. F.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 96 - 103
  • [49] Compensation of speaker directivity in speech recognition using HMM composition
    Giron, F
    Minami, Y
    Tanaka, M
    Furuya, K
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 253 - 256
  • [50] Speech Recognition Using Weighted HMM and Subspace Projection Approaches
    Su, Keh-Yih
    Lee, Chin-Hui
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 69 - 79