A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH

被引：0

作者：

He, Lei ^{[1
]}

Hao, Jie ^{[1
]}

机构：

[1] Toshiba China Res & Dev Ctr, Beijing 100738, Peoples R China

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

speech recognition; tone recognition; feature selection;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a tone recognition framework for continuous Mandarin speech. To model the variations of F0 pattern caused by co-articulation and phonetic effects, a set of discriminating features are extracted: 1) outlined features from the F0 contours of target syllable and neighboring syllables are combined; 2) contextual tone information is utilized within an iterative process; 3) phonetic information from target and neighboring syllables is incorporated. These features are put into a decision tree for tone classification, which follows an HMM-based toneless decoder. The results in 5-tone recognition experiments show more than 40% relative error rate reduction against the baseline local outlined features. Moreover, the proposed method obviously outperforms HMM-based tone model in speaker-independent evaluation.

引用

页码：1575 / 1578

页数：4

共 50 条

[31] An Efficient Algorithm for Syllable Hypothesization in Continuous Mandarin Speech Recognition
Huang, Eng-Fong
Wang, Hsiao-Chuan
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03): : 446 - 449
[32] Mandarin Speech Recognition Using Convolution Neural Network with Augmented Tone Features
Hu, Xinhui
Lu, Xugang
Hori, Chiori
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 15 - 18
[33] Decision tree based mandarin tone model and its application to speech recognition
Cao, Y
Deng, YG
Zhang, H
Huang, TY
Xu, B
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1759 - 1762
[34] Tone recognition for continuous Mandarin speech with limited training data using selected context-dependent hidden Markov models
Wang, Hsin-Min
Lee, Lin-Shan
Journal of the Chinese Institute of Engineers, Transactions of the Chinese Institute of Engineers,Series A/Chung-kuo Kung Ch'eng Hsuch K'an, 1994, 17 (06): : 775 - 784
[35] Tone recognition in continuous Cantonese speech using supratone models
Qian, Yao
Lee, Tan
Soong, Frank K.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (05): : 2936 - 2945
[36] TONE RECOGNITION OF CONTINUOUS MANDARINE SPEECH ASSISTED WITH PROSODIC INFORMATION
WANG, YR
CHEN, SH
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 96 (05): : 2637 - 2645
[37] Tone Recognition of Isolated Mandarin Syllables
Xie, Zhaoqiang
Miao, Zhenjiang
IMAGE AND SIGNAL PROCESSING, PROCEEDINGS, 2010, 6134 : 412 - 418
[38] Incorporating Tone Features to Convolutional Neural Network to Improve Mandarin/Thai Speech Recognition
Hu, Xinhui
Saiko, Masahiro
Hori, Chiori
2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
[39] Continuous mandarin speech recognition using hierarchical recurrent neural networks
Liao, YF
Chen, WY
Chen, SH
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3370 - 3373
[40] Visual information assisted mandarin large vocabulary continuous speech recognition
Liu, P
Wang, ZY
2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 72 - 77

← 1 2 3 4 5 →