A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH

被引：0

作者：

He, Lei ^{[1
]}

Hao, Jie ^{[1
]}

机构：

[1] Toshiba China Res & Dev Ctr, Beijing 100738, Peoples R China

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

speech recognition; tone recognition; feature selection;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a tone recognition framework for continuous Mandarin speech. To model the variations of F0 pattern caused by co-articulation and phonetic effects, a set of discriminating features are extracted: 1) outlined features from the F0 contours of target syllable and neighboring syllables are combined; 2) contextual tone information is utilized within an iterative process; 3) phonetic information from target and neighboring syllables is incorporated. These features are put into a decision tree for tone classification, which follows an HMM-based toneless decoder. The results in 5-tone recognition experiments show more than 40% relative error rate reduction against the baseline local outlined features. Moreover, the proposed method obviously outperforms HMM-based tone model in speaker-independent evaluation.

引用

下载

页码：1575 / 1578

页数：4

共 50 条

[21] IMPROVED TONE MODELING BY EXPLOITING ARTICULATORY FEATURES FOR MANDARIN SPEECH RECOGNITION
Chao, Hao
Yang, Zhanlei
Liu, Wenju
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4741 - 4744
[22] Automatic context induction for tone model integration in mandarin speech recognition
HUANG Hao1
The Journal of China Universities of Posts and Telecommunications, 2012, (01) : 94 - 100
[23] Real context model for tone recognition in mandarin conversational telephone speech
Liu, Zhaojie
Shao, Jian
Zhang, Pengyuan
Zhao, Qingwei
Yan, Yonghong
Feng, Ji
ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2007, : 696 - +
[24] Automatic context induction for tone model integration in mandarin speech recognition
HUANG HaoLI Binghu Department of Information Science and EngineeringXinjiang UniversityUrumqi China Laboratory of MultiLingual Information TechnologyXinjiang UniversityUrumqi China
TheJournalofChinaUniversitiesofPostsandTelecommunications, 2012, 19 (01) : 94 - 100
[25] Use tone detection to improve performance of mandarin digit speech recognition
Tsinghua Univ, Beijing, China
Qinghua Daxue Xuebao/Journal of Tsinghua University, 1998, 38 (09): : 36 - 39
[26] Improved mandarin speech recognition by lattice rescoring with enhanced tone models
Wang, Huanliang
Qian, Yao
Soong, Frank
Zhou, Jian-Lai
Han, Jiqing
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 445 - +
[27] Landmark-Guided Segmental Speech Decoding for Continuous Mandarin Speech Recognition
Chao, Hao
Song, Cheng
JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2016, 12 (03): : 410 - 421
[28] Novel Extended Phonemic Set for Mandarin Continuous Speech Recognition
谢湘
匡镜明
Journal of Beijing Institute of Technology, 2003, (04) : 399 - 402
[29] Feature selection in mandarin large vocabulary continuous speech recognition
Zhu, X
Chen, YN
Liu, J
Liu, RS
2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 508 - 511
[30] An MRNN-based method for continuous Mandarin speech recognition
Liao, YF
Chen, SH
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1121 - 1124

← 1 2 3 4 5 →