SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION USING FUZZY PARTITION MODEL (FPM) AND LR PARSERS

被引:0
|
作者
FUKAZAWA, K [1 ]
KATO, Y [1 ]
SUGIYAMA, M [1 ]
机构
[1] ATR INTERPRETING TELEPHONY RES LABS,KYOTO 619,JAPAN
关键词
SPEECH RECOGNITION; SPEAKER INDEPENDENCE; FPM; FPM-LR; NEURAL NETWORK;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper discusses speaker-independent continuous speech recognition using the neural network with a fuzzy partition model (FPM) for phoneme discrimination. Higher-speed learning can be realized with FPM than with the time-delay neural network (TDNN). Using this high-speed property allows speaker-independent phoneme discrimination learning to be realized (heretofore, long learning time was a serious drawback). This paper presents speaker-independent continuous speech recognition using the FPM-LR speech recognition system, wherein FPM is used for the phoneme discrimination and is to be combined with the LR parser. In this experiment, phoneme discrimination training is executed using as speech samples 8 males and 8 females; the recognition performance is evaluated using 278 phrases. The following observations are derived as a result of experiment. A FPM requires less training time than TDNN. The performance can be improved by using the Multi-FPM-LR system, where processes for male, female and mixed case are combined. It is useful to add the power and the delta-spectrum to the acoustic feature parameters. It is effective to use speeches of various utterances (word and phrase) in the training. An 80.0 percent recognition rate is achieved in the recognition of 278 phrases. Finally, the result for the sentence speech recognition is presented.
引用
收藏
页码:32 / 48
页数:17
相关论文
共 50 条
  • [41] HMM-based integrated method for speaker-independent speech recognition
    Tsinghua Univ, Beijing, China
    [J]. Int Conf Signal Process Proc, (613-616):
  • [42] REFERENCE TEMPLATE ADAPTATION IN SPEAKER-INDEPENDENT ISOLATED WORD SPEECH RECOGNITION
    MCINNES, FR
    JACK, MA
    [J]. ELECTRONICS LETTERS, 1987, 23 (24) : 1304 - 1305
  • [43] SPEAKER-INDEPENDENT SPEECH RECOGNITION UNIT DEVELOPMENT FOR TELEPHONE LINE USE
    ISHII, N
    IMAI, Y
    NAKATSU, R
    ANDO, M
    [J]. JAPAN TELECOMMUNICATIONS REVIEW, 1982, 24 (03): : 267 - 274
  • [44] NORMALIZING THE VOCAL-TRACT LENGTH FOR SPEAKER-INDEPENDENT SPEECH RECOGNITION
    LIN, QG
    CHE, CW
    [J]. IEEE SIGNAL PROCESSING LETTERS, 1995, 2 (11) : 201 - 203
  • [45] SPEAKER-INDEPENDENT SPEECH-RECOGNITION SYSTEM BASED ON LINEAR PREDICTION
    GUPTA, VN
    BRYAN, JK
    GOWDY, JN
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1978, 26 (01): : 27 - 33
  • [46] DSP-based large vocabulary speaker-independent speech recognition
    Hirayama, H
    Yoshida, K
    Koga, S
    Hattori, H
    [J]. NEC RESEARCH & DEVELOPMENT, 1996, 37 (04): : 528 - 534
  • [47] A HMM-based integrated method for speaker-independent speech recognition
    Zhang, YY
    Zhu, XY
    [J]. ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 613 - 616
  • [48] Continuous speech of speaker-independent based on two weight neural networks
    Cao Wen-ming
    Ye Hong
    Xu Chun-yan
    Wang Shou-jue
    [J]. PROCEEDINGS OF 2005 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1 AND 2, 2005, : 1415 - +
  • [49] Speaker Recognition using Speaker-independent Universal Acoustic Model and Synchronous Sensing for "Business Microscope"
    Nishimura, Jun
    Kuroda, Tadahiro
    [J]. ISWPC: 2009 4TH INTERNATIONAL SYMPOSIUM ON WIRELESS PERVASIVE COMPUTING, 2009, : 304 - 308
  • [50] Domain Invariant Feature Learning for Speaker-Independent Speech Emotion Recognition
    Lu, Cheng
    Zong, Yuan
    Zheng, Wenming
    Li, Yang
    Tang, Chuangao
    Schuller, Bjoern W.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2217 - 2230