TONE RECOGNITION OF CONTINUOUS MANDARINE SPEECH-BASED ON NEURAL NETWORKS

被引：0

作者：

CHEN, SH

WANG, YR

机构：

[1] Natl Chiao Tung Univ, Taiwan

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1995年 / 3卷 / 02期

关键词：

Number:; -; Acronym:; NSC; Sponsor: National Science Council; MOTC; Sponsor: Ministry of Transportation and Communications;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Several neural network-based tone recognition schemes for continuous Mandarin speech are discussed. A basic MLP tone recognizer using recognition features extracted from the processing syllable is first introduced. Then, some additional features extracted from neighboring syllables are added to compensate for the coarticulation effect. It is then further improved to compensate for the effect of sandhi rules of tone pronunciation by including tone information of neighboring syllables. The recognition criterion is now changed to find the best tone sequence that minimizes the total risk that simultaneously considers tone recognition of all syllables in the input utterance. Last, two approaches using HCNN and HSMLP, respectively, to model the intonation pattern as a hidden Markov chain for assisting tone recognition are proposed. The effectiveness of these schemes was confirmed by simulations on a speaker-independent tone recognition task. A recognition rate of 86.72% was achieved.

引用

页码：146 / 150

页数：5

共 50 条

[1] TONE RECOGNITION OF CONTINUOUS MANDARINE SPEECH ASSISTED WITH PROSODIC INFORMATION
WANG, YR
CHEN, SH
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 96 (05): : 2637 - 2645
[2] Tone recognition of continuous Mandarin speech based on tone nucleus model and neural network
Wang, Xiao-Dong
Hirose, Keikichi
Zhang, Jin-Song
Minematsu, Nobuaki
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (06) : 1748 - 1755
[3] Continuous speech recognition by convolutional neural networks
Zhang, Qing-Qing
Liu, Yong
Pan, Jie-Lin
Yan, Yong-Hong
Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2015, 37 (09): : 1212 - 1217
[4] NEURAL NETWORKS FOR STATISTICAL RECOGNITION OF CONTINUOUS SPEECH
MORGAN, N
BOURLARD, HA
PROCEEDINGS OF THE IEEE, 1995, 83 (05) : 742 - 770
[5] Multicriteria Neural Network Design in the Speech-based Emotion Recognition Problem
Brester, Christina
Semenkin, Eugene
Sidorov, Maxim
Semenkina, Olga
ICIMCO 2015 PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL. 1, 2015, : 621 - 628
[6] PROLOG TO NEURAL NETWORKS FOR STATISTICAL RECOGNITION OF CONTINUOUS SPEECH
FALK, H
PROCEEDINGS OF THE IEEE, 1995, 83 (05) : 741 - 741
[7] Continuous Speech Emotion Recognition with Convolutional Neural Networks
Vryzas, Nikolaos
Vrysis, Lazaros
Matsiola, Maria
Kotsakis, Rigas
Dimoulas, Charalampos
Kalliris, George
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2020, 68 (1-2): : 14 - 24
[8] Continuous speech emotion recognition with convolutional neural networks
Vryzas, Nikolaos
Vrysis, Lazaros
Matsiola, Maria
Kotsakis, Rigas
Dimoulas, Charalampos
Kalliris, George
AES: Journal of the Audio Engineering Society, 2020, 68 (1-2): : 14 - 24
[9] Robust Speech-Based Happiness Recognition
Lin, Chang-Hong
Siahaan, Ernestasia
Chin, Yu-Hau
Chen, Bo-Wei
Wang, Jia-Ching
Wang, Jhing-Fa
1ST INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT 2013), 2013, : 227 - 230
[10] A review of speech-based bimodal recognition
Chibelushi, CC
Deravi, F
Mason, JSD
IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (01) : 23 - 37

← 1 2 3 4 5 →