TONE RECOGNITION OF CONTINUOUS MANDARINE SPEECH-BASED ON NEURAL NETWORKS

被引:0
|
作者
CHEN, SH
WANG, YR
机构
[1] Natl Chiao Tung Univ, Taiwan
来源
关键词
Number:; -; Acronym:; NSC; Sponsor: National Science Council; MOTC; Sponsor: Ministry of Transportation and Communications;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Several neural network-based tone recognition schemes for continuous Mandarin speech are discussed. A basic MLP tone recognizer using recognition features extracted from the processing syllable is first introduced. Then, some additional features extracted from neighboring syllables are added to compensate for the coarticulation effect. It is then further improved to compensate for the effect of sandhi rules of tone pronunciation by including tone information of neighboring syllables. The recognition criterion is now changed to find the best tone sequence that minimizes the total risk that simultaneously considers tone recognition of all syllables in the input utterance. Last, two approaches using HCNN and HSMLP, respectively, to model the intonation pattern as a hidden Markov chain for assisting tone recognition are proposed. The effectiveness of these schemes was confirmed by simulations on a speaker-independent tone recognition task. A recognition rate of 86.72% was achieved.
引用
收藏
页码:146 / 150
页数:5
相关论文
共 50 条
  • [41] Stacked Recurrent Neural Networks for Speech-Based Inference of Attachment Condition in School Age Children
    Alsofyani, Huda
    Vinciarelli, Alessandro
    INTERSPEECH 2021, 2021, : 2491 - 2495
  • [42] A CLUSTER-BASED MULTIPLE DEEP NEURAL NETWORKS METHOD FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
    Zhou, Pan
    Liu, Cong
    Liu, Qingfeng
    Dai, Lirong
    Jiang, Hui
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6650 - 6654
  • [43] Convolutional Neural Networks to Facilitate the Continuous Recognition of Arabic Speech with Independent Speakers
    Sayed, Sally A.
    Seoud, Rania Ahmed Abdel Azeem Abul
    Naby, Howida Y. Abdel
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2024, 2024
  • [44] BAYESIAN AND GAUSSIAN PROCESS NEURAL NETWORKS FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
    Hu, Shoukang
    Lam, Max W. Y.
    Xie, Xurong
    Liu, Shansong
    Yu, Jianwei
    Wu, Xixin
    Liu, Xunying
    Meng, Helen
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6555 - 6559
  • [45] Continuous speech recognition with neural networks and stationary-transitional acoustic units
    Gemello, R.
    Albesano, D.
    Mana, F.
    CSELT Technical Reports, 1997, 25 (03): : 389 - 398
  • [46] Utilizing Psychoacoustic Modeling to Improve Speech-Based Emotion Recognition
    Siegert, Ingo
    Lotz, Alicia Flores
    Egorow, Olga
    Wolff, Susann
    SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 625 - 635
  • [47] Speech-Based Automatic Recognition Technology for Major Depression Disorder
    Yang, Zhixin
    Li, Hualiang
    Li, Li
    Zhang, Kai
    Xiong, Chaolin
    Liu, Yuzhong
    HUMAN CENTERED COMPUTING, 2019, 11956 : 546 - 553
  • [48] Tone recognition of continuous Mandarin speech assisted with prosodic information
    1600, American Inst of Physics, Woodbury, NY, USA (96):
  • [49] Tone recognition in continuous Cantonese speech using supratone models
    Qian, Yao
    Lee, Tan
    Soong, Frank K.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (05): : 2936 - 2945
  • [50] Continuous speech segmentation algorithms based on artificial neural networks
    Gavat, IH
    Dumitru, COO
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XIV, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING III, 2002, : 111 - 114