TONE RECOGNITION OF CONTINUOUS MANDARINE SPEECH-BASED ON NEURAL NETWORKS

被引：0

作者：

CHEN, SH

WANG, YR

机构：

[1] Natl Chiao Tung Univ, Taiwan

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1995年 / 3卷 / 02期

关键词：

Number:; -; Acronym:; NSC; Sponsor: National Science Council; MOTC; Sponsor: Ministry of Transportation and Communications;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Several neural network-based tone recognition schemes for continuous Mandarin speech are discussed. A basic MLP tone recognizer using recognition features extracted from the processing syllable is first introduced. Then, some additional features extracted from neighboring syllables are added to compensate for the coarticulation effect. It is then further improved to compensate for the effect of sandhi rules of tone pronunciation by including tone information of neighboring syllables. The recognition criterion is now changed to find the best tone sequence that minimizes the total risk that simultaneously considers tone recognition of all syllables in the input utterance. Last, two approaches using HCNN and HSMLP, respectively, to model the intonation pattern as a hidden Markov chain for assisting tone recognition are proposed. The effectiveness of these schemes was confirmed by simulations on a speaker-independent tone recognition task. A recognition rate of 86.72% was achieved.

引用

页码：146 / 150

页数：5

共 50 条

[41] Stacked Recurrent Neural Networks for Speech-Based Inference of Attachment Condition in School Age Children
Alsofyani, Huda
Vinciarelli, Alessandro
INTERSPEECH 2021, 2021, : 2491 - 2495
[42] A CLUSTER-BASED MULTIPLE DEEP NEURAL NETWORKS METHOD FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
Zhou, Pan
Liu, Cong
Liu, Qingfeng
Dai, Lirong
Jiang, Hui
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6650 - 6654
[43] Convolutional Neural Networks to Facilitate the Continuous Recognition of Arabic Speech with Independent Speakers
Sayed, Sally A.
Seoud, Rania Ahmed Abdel Azeem Abul
Naby, Howida Y. Abdel
JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2024, 2024
[44] BAYESIAN AND GAUSSIAN PROCESS NEURAL NETWORKS FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
Hu, Shoukang
Lam, Max W. Y.
Xie, Xurong
Liu, Shansong
Yu, Jianwei
Wu, Xixin
Liu, Xunying
Meng, Helen
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6555 - 6559
[45] Continuous speech recognition with neural networks and stationary-transitional acoustic units
Gemello, R.
Albesano, D.
Mana, F.
CSELT Technical Reports, 1997, 25 (03): : 389 - 398
[46] Utilizing Psychoacoustic Modeling to Improve Speech-Based Emotion Recognition
Siegert, Ingo
Lotz, Alicia Flores
Egorow, Olga
Wolff, Susann
SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 625 - 635
[47] Speech-Based Automatic Recognition Technology for Major Depression Disorder
Yang, Zhixin
Li, Hualiang
Li, Li
Zhang, Kai
Xiong, Chaolin
Liu, Yuzhong
HUMAN CENTERED COMPUTING, 2019, 11956 : 546 - 553
[48] Tone recognition of continuous Mandarin speech assisted with prosodic information
1600, American Inst of Physics, Woodbury, NY, USA (96):
[49] Tone recognition in continuous Cantonese speech using supratone models
Qian, Yao
Lee, Tan
Soong, Frank K.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (05): : 2936 - 2945
[50] Continuous speech segmentation algorithms based on artificial neural networks
Gavat, IH
Dumitru, COO
6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XIV, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING III, 2002, : 111 - 114

← 1 2 3 4 5 →