Practical speaker-independent voice recognition using segmental features

被引:0
|
作者
Kimura, T [1 ]
Ashida, A [1 ]
Niyada, K [1 ]
机构
[1] Matsushita Commun Ind Co Ltd, Yokohama, Kanagawa 2248539, Japan
关键词
voice recognition; word spotting; acoustic model; dynamic feature; HMM;
D O I
10.1002/ecjb.10217
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper reports a practical method that achieves speaker-independent large-vocabulary voice recognition with high accuracy and high noise immunity but with small computational complexity. The first feature of the method is that highly accurate recognition is realized by using an acoustic model in which the input consists of the segmental features formed by the analysis parameters of multiple frames. The second feature is that the likelihood corresponding to the output probability of each state of the acoustic model is calculated by a linear expression with the input parameter vector as the variable. The linear expression is derived from the equal-covariance assumption. This linear expression reduces the computational complexity and the required memory capacity even if segmental features are used, without degrading recognition performance. The third feature is that a stable word spotting function detects correct solution candidates from the noise interval by including the idea of the a posteriori probability in the likelihood calculation. This word spotting function allows voice recognition which is robust to noise. to be realized. The effectiveness of the proposed system was demonstrated in a recognition experiment with superimposed interior noise of a running vehicle. (C) 2004 Wiley Periodicals, Inc.
引用
收藏
页码:73 / 81
页数:9
相关论文
共 50 条
  • [1] Speaker-Independent Speech Recognition using Visual Features
    Pooventhiran, G.
    Sandeep, A.
    Manthiravalli, K.
    Harish, D.
    Renuka, Karthika D.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (11) : 616 - 620
  • [2] Speaker-Independent Speech Recognition using Visual Features
    Pooventhiran G.
    Sandeep A.
    Manthiravalli K.
    Harish D.
    Karthika R.D.
    [J]. International Journal of Advanced Computer Science and Applications, 2020, 11 (11): : 616 - 620
  • [3] Speaker-independent recognition of isolated voice commands using auditory models
    Kolokolev, AS
    Yakhno, VP
    [J]. AUTOMATION AND REMOTE CONTROL, 1995, 56 (08) : 1176 - 1182
  • [5] SPEAKER-INDEPENDENT ISOLATED WORD RECOGNITION USING DYNAMIC FEATURES OF SPEECH SPECTRUM
    FURUI, S
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (01): : 52 - 59
  • [6] Speaker-independent recognition of Chinese tones
    GUAN Cuntai and CHEN Yongbin(Dep. of Radio Eng.
    [J]. Chinese Journal of Acoustics, 1993, (02) : 142 - 148
  • [7] SPEAKER-INDEPENDENT WORD RECOGNITION TECHNIQUES FOR CONTROL OF A VOICE MESSAGING SYSTEM.
    Gupta, V.
    Mermelstein, P.
    [J]. U.S. Symposium on Rock Mechanics, 1981, : 233 - 238
  • [8] SPEAKER-INDEPENDENT DIGIT RECOGNITION SYSTEM
    SAMBUR, MR
    RABINER, LR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 56 : S26 - S26
  • [9] SPEAKER-INDEPENDENT MANDARINE PLOSIVE RECOGNITION WITH DYNAMIC FEATURES AND MULTILAYER PERCEPTRONS
    CHEN, WY
    CHEN, SH
    [J]. ELECTRONICS LETTERS, 1995, 31 (04) : 258 - 259
  • [10] DYNAMIC SPEAKER ADAPTATION IN SPEAKER-INDEPENDENT WORD RECOGNITION
    HEWETT, AJ
    HOLMES, G
    YOUNG, SJ
    [J]. PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 275 - 282