Likelihood normalization using an ergodic HMM for continuous speech recognition

被引：0

作者：

Ozeki, K

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In recent speech recognition technology, the score of a hypothesis is often defined on the basis of HMM likelihood. As is well known, however, direct use of the likelihood as a scoring function causes difficult problems especially when the length of a speech segment varies depending on the hypothesis as in word-spotting, and some kind of normalization is indispensable. In this paper, a new method of likelihood normalization using an ergodic HMM is presented, and its performance is compared with those of conventional ones. The comparison is made fr om three points of view: recognition rate, word-end detection power, and the mean hypothesis length. It is concluded that the proposed method. gives the best overall performance.

引用

页码：2301 / 2304

页数：4

共 50 条

[21] Using SIMD technology to speed up likelihood computation in HMM-based speech recognition systems
Ou, Jianlin
Cai, Jun
Lin, Qian
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 123 - 127
[22] Penalized Logistic Regression With HMM Log-Likelihood Regressors for Speech Recognition
Birkenes, Oystein
Matsui, Tomoko
Tanabe, Kunio
Siniscalchi, Sabato Marco
Myrvoll, Tor Andre
Johnsen, Magne Hallstein
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1440 - 1454
[23] A MAXIMUM-LIKELIHOOD APPROACH TO CONTINUOUS SPEECH RECOGNITION
BAHL, LR
JELINEK, F
MERCER, RL
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1983, 5 (02) : 179 - 190
[24] Irrelevant variability normalization based HMM training using map estimation of feature transforms for robust speech recognition
Zhu, Donglai
Huo, Qiang
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4717 - +
[25] Emotion Recognition using Continuous Density HMM
Anila, R.
Revathy, A.
2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2015, : 919 - 923
[26] Face recognition using Pseudo-2D Ergodic HMM
Kumar, S. A. Santosh
Deepti, D. R.
Prabhakar, B.
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1617 - 1620
[27] A GMM/HMM model for reconstruction of missing speech spectral components for continuous speech recognition
Goodarzi M.M.
Almasganj F.
International Journal of Speech Technology, 2016, 19 (4) : 769 - 777
[28] A comparison between HMM and hybrid ANN-HMM based systems for continuous speech recognition
Ynoguti, CA
Morais, ED
Violaro, F
ITS '98 PROCEEDINGS - SBT/IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 1998, : 135 - 140
[29] A One-Step Tone Recognition Approach Using MSD-HMM for Continuous Speech
Liu, Changliang
Ge, Fengpei
Pan, Fuping
Dong, Bin
Yan, Yonghong
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2975 - 2978
[30] Implementation of Embedded Unspecific Continuous English Speech Recognition Based on HMM
Lu, Xiaoli
Shah, Mohd Asif
RECENT ADVANCES IN ELECTRICAL & ELECTRONIC ENGINEERING, 2021, 14 (06) : 649 - 659

← 1 2 3 4 5 →