TONE RECOGNITION OF CONTINUOUS MANDARINE SPEECH-BASED ON NEURAL NETWORKS

被引:0
|
作者
CHEN, SH
WANG, YR
机构
[1] Natl Chiao Tung Univ, Taiwan
来源
关键词
Number:; -; Acronym:; NSC; Sponsor: National Science Council; MOTC; Sponsor: Ministry of Transportation and Communications;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Several neural network-based tone recognition schemes for continuous Mandarin speech are discussed. A basic MLP tone recognizer using recognition features extracted from the processing syllable is first introduced. Then, some additional features extracted from neighboring syllables are added to compensate for the coarticulation effect. It is then further improved to compensate for the effect of sandhi rules of tone pronunciation by including tone information of neighboring syllables. The recognition criterion is now changed to find the best tone sequence that minimizes the total risk that simultaneously considers tone recognition of all syllables in the input utterance. Last, two approaches using HCNN and HSMLP, respectively, to model the intonation pattern as a hidden Markov chain for assisting tone recognition are proposed. The effectiveness of these schemes was confirmed by simulations on a speaker-independent tone recognition task. A recognition rate of 86.72% was achieved.
引用
收藏
页码:146 / 150
页数:5
相关论文
共 50 条
  • [31] Continuous mandarin speech recognition using hierarchical recurrent neural networks
    Liao, YF
    Chen, WY
    Chen, SH
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3370 - 3373
  • [32] Toward growing modular deep neural networks for continuous speech recognition
    Ansari, Zohreh
    Seyyedsalehi, Seyyed Ali
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 : S1177 - S1196
  • [33] Continuous speech recognition with neural networks and stationary-transitional acoustic
    Gemello, R
    Albesano, D
    Mana, F
    1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 2107 - 2111
  • [34] Speech-based Emotion Recognition and Next Reaction Prediction
    Noroozi, Fatemeh
    Akrami, Neda
    Anbarjafari, Gholamreza
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [35] Speech Recognition Based on Weight Function Neural Networks
    Zhang, Daiyuan
    Zhao, Ran
    APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 1565 - 1568
  • [36] Mongolian Speech Recognition Based on Deep Neural Networks
    Zhang, Hui
    Bao, Feilong
    Gao, Guanglai
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 180 - 188
  • [37] A HIERARCHICAL NEURAL NETWORK MODEL BASED ON A C/V SEGMENTATION ALGORITHM FOR ISOLATED MANDARINE SPEECH RECOGNITION
    WANG, JF
    WU, CH
    CHANG, SH
    LEE, JY
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (09) : 2141 - 2146
  • [38] Deep Neural Networks for Mandarin Tone Recognition
    Chen, Mingming
    Yang, Zhanlei
    Liu, Wenju
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 1154 - 1158
  • [39] ECHO: A speech recognition package for the design of robust interactive speech-based applications
    Kabré H.
    International Journal of Speech Technology, 1997, 2 (2) : 133 - 143
  • [40] Compensate the Speech Recognition Delays for Accurate Speech-Based Cursor Position Control
    Tong, Qiang
    Wang, Ziyun
    HUMAN-COMPUTER INTERACTION, PT II, 2009, 5611 : 752 - 760