TONE RECOGNITION OF CONTINUOUS MANDARINE SPEECH-BASED ON NEURAL NETWORKS

被引:0
|
作者
CHEN, SH
WANG, YR
机构
[1] Natl Chiao Tung Univ, Taiwan
来源
关键词
Number:; -; Acronym:; NSC; Sponsor: National Science Council; MOTC; Sponsor: Ministry of Transportation and Communications;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Several neural network-based tone recognition schemes for continuous Mandarin speech are discussed. A basic MLP tone recognizer using recognition features extracted from the processing syllable is first introduced. Then, some additional features extracted from neighboring syllables are added to compensate for the coarticulation effect. It is then further improved to compensate for the effect of sandhi rules of tone pronunciation by including tone information of neighboring syllables. The recognition criterion is now changed to find the best tone sequence that minimizes the total risk that simultaneously considers tone recognition of all syllables in the input utterance. Last, two approaches using HCNN and HSMLP, respectively, to model the intonation pattern as a hidden Markov chain for assisting tone recognition are proposed. The effectiveness of these schemes was confirmed by simulations on a speaker-independent tone recognition task. A recognition rate of 86.72% was achieved.
引用
收藏
页码:146 / 150
页数:5
相关论文
共 50 条
  • [1] TONE RECOGNITION OF CONTINUOUS MANDARINE SPEECH ASSISTED WITH PROSODIC INFORMATION
    WANG, YR
    CHEN, SH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 96 (05): : 2637 - 2645
  • [2] Tone recognition of continuous Mandarin speech based on tone nucleus model and neural network
    Wang, Xiao-Dong
    Hirose, Keikichi
    Zhang, Jin-Song
    Minematsu, Nobuaki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (06) : 1748 - 1755
  • [3] Continuous speech recognition by convolutional neural networks
    Zhang, Qing-Qing
    Liu, Yong
    Pan, Jie-Lin
    Yan, Yong-Hong
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2015, 37 (09): : 1212 - 1217
  • [4] NEURAL NETWORKS FOR STATISTICAL RECOGNITION OF CONTINUOUS SPEECH
    MORGAN, N
    BOURLARD, HA
    PROCEEDINGS OF THE IEEE, 1995, 83 (05) : 742 - 770
  • [5] Multicriteria Neural Network Design in the Speech-based Emotion Recognition Problem
    Brester, Christina
    Semenkin, Eugene
    Sidorov, Maxim
    Semenkina, Olga
    ICIMCO 2015 PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL. 1, 2015, : 621 - 628
  • [7] Continuous Speech Emotion Recognition with Convolutional Neural Networks
    Vryzas, Nikolaos
    Vrysis, Lazaros
    Matsiola, Maria
    Kotsakis, Rigas
    Dimoulas, Charalampos
    Kalliris, George
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2020, 68 (1-2): : 14 - 24
  • [8] Continuous speech emotion recognition with convolutional neural networks
    Vryzas, Nikolaos
    Vrysis, Lazaros
    Matsiola, Maria
    Kotsakis, Rigas
    Dimoulas, Charalampos
    Kalliris, George
    AES: Journal of the Audio Engineering Society, 2020, 68 (1-2): : 14 - 24
  • [9] Robust Speech-Based Happiness Recognition
    Lin, Chang-Hong
    Siahaan, Ernestasia
    Chin, Yu-Hau
    Chen, Bo-Wei
    Wang, Jia-Ching
    Wang, Jhing-Fa
    1ST INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT 2013), 2013, : 227 - 230
  • [10] A review of speech-based bimodal recognition
    Chibelushi, CC
    Deravi, F
    Mason, JSD
    IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (01) : 23 - 37