TONE RECOGNITION OF CONTINUOUS MANDARINE SPEECH-BASED ON NEURAL NETWORKS

被引：0

作者：

CHEN, SH

WANG, YR

机构：

[1] Natl Chiao Tung Univ, Taiwan

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1995年 / 3卷 / 02期

关键词：

Number:; -; Acronym:; NSC; Sponsor: National Science Council; MOTC; Sponsor: Ministry of Transportation and Communications;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Several neural network-based tone recognition schemes for continuous Mandarin speech are discussed. A basic MLP tone recognizer using recognition features extracted from the processing syllable is first introduced. Then, some additional features extracted from neighboring syllables are added to compensate for the coarticulation effect. It is then further improved to compensate for the effect of sandhi rules of tone pronunciation by including tone information of neighboring syllables. The recognition criterion is now changed to find the best tone sequence that minimizes the total risk that simultaneously considers tone recognition of all syllables in the input utterance. Last, two approaches using HCNN and HSMLP, respectively, to model the intonation pattern as a hidden Markov chain for assisting tone recognition are proposed. The effectiveness of these schemes was confirmed by simulations on a speaker-independent tone recognition task. A recognition rate of 86.72% was achieved.

引用

页码：146 / 150

页数：5

共 50 条

[31] Continuous mandarin speech recognition using hierarchical recurrent neural networks
Liao, YF
Chen, WY
Chen, SH
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3370 - 3373
[32] Toward growing modular deep neural networks for continuous speech recognition
Ansari, Zohreh
Seyyedsalehi, Seyyed Ali
NEURAL COMPUTING & APPLICATIONS, 2017, 28 : S1177 - S1196
[33] Continuous speech recognition with neural networks and stationary-transitional acoustic
Gemello, R
Albesano, D
Mana, F
1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 2107 - 2111
[34] Speech-based Emotion Recognition and Next Reaction Prediction
Noroozi, Fatemeh
Akrami, Neda
Anbarjafari, Gholamreza
2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
[35] Speech Recognition Based on Weight Function Neural Networks
Zhang, Daiyuan
Zhao, Ran
APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 1565 - 1568
[36] Mongolian Speech Recognition Based on Deep Neural Networks
Zhang, Hui
Bao, Feilong
Gao, Guanglai
CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 180 - 188
[37] A HIERARCHICAL NEURAL NETWORK MODEL BASED ON A C/V SEGMENTATION ALGORITHM FOR ISOLATED MANDARINE SPEECH RECOGNITION
WANG, JF
WU, CH
CHANG, SH
LEE, JY
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (09) : 2141 - 2146
[38] Deep Neural Networks for Mandarin Tone Recognition
Chen, Mingming
Yang, Zhanlei
Liu, Wenju
PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 1154 - 1158
[39] ECHO: A speech recognition package for the design of robust interactive speech-based applications
Kabré H.
International Journal of Speech Technology, 1997, 2 (2) : 133 - 143
[40] Compensate the Speech Recognition Delays for Accurate Speech-Based Cursor Position Control
Tong, Qiang
Wang, Ziyun
HUMAN-COMPUTER INTERACTION, PT II, 2009, 5611 : 752 - 760

← 1 2 3 4 5 →