SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION USING FUZZY PARTITION MODEL (FPM) AND LR PARSERS

被引:0
|
作者
FUKAZAWA, K [1 ]
KATO, Y [1 ]
SUGIYAMA, M [1 ]
机构
[1] ATR INTERPRETING TELEPHONY RES LABS,KYOTO 619,JAPAN
关键词
SPEECH RECOGNITION; SPEAKER INDEPENDENCE; FPM; FPM-LR; NEURAL NETWORK;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper discusses speaker-independent continuous speech recognition using the neural network with a fuzzy partition model (FPM) for phoneme discrimination. Higher-speed learning can be realized with FPM than with the time-delay neural network (TDNN). Using this high-speed property allows speaker-independent phoneme discrimination learning to be realized (heretofore, long learning time was a serious drawback). This paper presents speaker-independent continuous speech recognition using the FPM-LR speech recognition system, wherein FPM is used for the phoneme discrimination and is to be combined with the LR parser. In this experiment, phoneme discrimination training is executed using as speech samples 8 males and 8 females; the recognition performance is evaluated using 278 phrases. The following observations are derived as a result of experiment. A FPM requires less training time than TDNN. The performance can be improved by using the Multi-FPM-LR system, where processes for male, female and mixed case are combined. It is useful to add the power and the delta-spectrum to the acoustic feature parameters. It is effective to use speeches of various utterances (word and phrase) in the training. An 80.0 percent recognition rate is achieved in the recognition of 278 phrases. Finally, the result for the sentence speech recognition is presented.
引用
收藏
页码:32 / 48
页数:17
相关论文
共 50 条
  • [1] A speaker-independent continuous speech recognition system using biomimetic pattern recognition
    Wang Shoujue
    Qin Hong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (03) : 460 - 462
  • [2] SPEAKER-CONSISTENT PARSING FOR SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION
    YAMAGUCHI, K
    SINGER, H
    MATSUNAGA, S
    SAGAYAMA, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (06) : 719 - 724
  • [3] ON LARGE-VOCABULARY SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION
    LEE, KF
    [J]. SPEECH COMMUNICATION, 1988, 7 (04) : 375 - 379
  • [4] SPEAKER-INDEPENDENT CONTINUOUS SPEECH DICTATION
    GAUVAIN, JL
    LAMEL, LF
    ADDA, G
    ADDADECKER, M
    [J]. SPEECH COMMUNICATION, 1994, 15 (1-2) : 21 - 37
  • [5] The study on continuous speech of speaker-independent
    Ye Hong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (4A) : 921 - 924
  • [6] Speaker-Independent Speech Recognition using Visual Features
    Pooventhiran, G.
    Sandeep, A.
    Manthiravalli, K.
    Harish, D.
    Renuka, Karthika D.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (11) : 616 - 620
  • [7] Speaker-Independent Speech Recognition using Visual Features
    Pooventhiran G.
    Sandeep A.
    Manthiravalli K.
    Harish D.
    Karthika R.D.
    [J]. International Journal of Advanced Computer Science and Applications, 2020, 11 (11): : 616 - 620
  • [8] Biomimetic pattern recognition for speaker-independent speech recognition
    Qin, H
    Wang, SJ
    Sun, H
    [J]. PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 1290 - 1294
  • [9] Predictor codebook for speaker-independent speech recognition
    Kawabata, Takeshi
    [J]. Systems and Computers in Japan, 1994, 25 (01): : 37 - 46
  • [10] SPEAKER-INDEPENDENT VOWEL RECOGNITION IN PERSIAN SPEECH
    Nazari, Mohammad
    Sayadiyan, Abolghasem
    Valiollahzadeh, Seyyed Majid
    [J]. 2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 672 - 676