TONE RECOGNITION OF CONTINUOUS MANDARINE SPEECH-BASED ON NEURAL NETWORKS

被引:0
|
作者
CHEN, SH
WANG, YR
机构
[1] Natl Chiao Tung Univ, Taiwan
来源
关键词
Number:; -; Acronym:; NSC; Sponsor: National Science Council; MOTC; Sponsor: Ministry of Transportation and Communications;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Several neural network-based tone recognition schemes for continuous Mandarin speech are discussed. A basic MLP tone recognizer using recognition features extracted from the processing syllable is first introduced. Then, some additional features extracted from neighboring syllables are added to compensate for the coarticulation effect. It is then further improved to compensate for the effect of sandhi rules of tone pronunciation by including tone information of neighboring syllables. The recognition criterion is now changed to find the best tone sequence that minimizes the total risk that simultaneously considers tone recognition of all syllables in the input utterance. Last, two approaches using HCNN and HSMLP, respectively, to model the intonation pattern as a hidden Markov chain for assisting tone recognition are proposed. The effectiveness of these schemes was confirmed by simulations on a speaker-independent tone recognition task. A recognition rate of 86.72% was achieved.
引用
收藏
页码:146 / 150
页数:5
相关论文
共 50 条
  • [21] Automatic Speech Recognition Based on Neural Networks
    Schlueter, Ralf
    Doetsch, Patrick
    Golik, Pavel
    Kitza, Markus
    Menne, Tobias
    Irie, Kazuki
    Tueske, Zoltan
    Zeyer, Albert
    SPEECH AND COMPUTER, 2016, 9811 : 3 - 17
  • [22] SPEECH-BASED STRESS CLASSIFICATION BASED ON MODULATION SPECTRAL FEATURES AND CONVOLUTIONAL NEURAL NETWORKS
    Avila, Anderson R.
    Kshirsagar, Shruti R.
    Tiwari, Abhishek
    Lafond, Daniel
    O'Shaughnessy, Douglas
    Falk, Tiago H.
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [23] Effect of Reverberation in Speech-based Emotion Recognition
    Zhao, Shujie
    Yang, Yan
    Chen, Jingdong
    2018 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING IN ISRAEL (ICSEE), 2018,
  • [24] An investigation of speech-based human emotion recognition
    Wang, YJ
    Guan, L
    2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2004, : 15 - 18
  • [25] Towards Robust Speech-Based Emotion Recognition
    Tabatabaei, Talieh S.
    Krishnan, Sridhar
    2010 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [26] Tone recognition of continuous Cantonese speech based on support vector machines
    Peng, G
    Wang, WSY
    SPEECH COMMUNICATION, 2005, 45 (01) : 49 - 62
  • [27] Novel Speech-Based Emotion Climate Recognition in Peers' Conversations Incorporating Affect Dynamics and Temporal Convolutional Neural Networks
    Alhussein, Ghada
    Alkhodari, Mohanad
    Khandoker, Ahsan H.
    Hadjileontiadis, Leontios J.
    IEEE ACCESS, 2025, 13 : 16752 - 16769
  • [28] Continuous Speech Recognition based on Convolutional Neural Network
    Zhang, Qing-qing
    Liu, Yong
    Pan, Jie-lin
    Yan, Yong-hong
    SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2015), 2015, 9631
  • [29] TONE RECOGNITION FOR CONTINUOUS MANDARINE SPEECH WITH LIMITED TRAINING DATA USING SELECTED CONTEXT-DEPENDENT HIDDEN MARKOV-MODELS
    WANG, HM
    LEE, LS
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 1994, 17 (06) : 775 - 784
  • [30] Toward growing modular deep neural networks for continuous speech recognition
    Zohreh Ansari
    Seyyed Ali Seyyedsalehi
    Neural Computing and Applications, 2017, 28 : 1177 - 1196