A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH

被引:0
|
作者
He, Lei [1 ]
Hao, Jie [1 ]
机构
[1] Toshiba China Res & Dev Ctr, Beijing 100738, Peoples R China
关键词
speech recognition; tone recognition; feature selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a tone recognition framework for continuous Mandarin speech. To model the variations of F0 pattern caused by co-articulation and phonetic effects, a set of discriminating features are extracted: 1) outlined features from the F0 contours of target syllable and neighboring syllables are combined; 2) contextual tone information is utilized within an iterative process; 3) phonetic information from target and neighboring syllables is incorporated. These features are put into a decision tree for tone classification, which follows an HMM-based toneless decoder. The results in 5-tone recognition experiments show more than 40% relative error rate reduction against the baseline local outlined features. Moreover, the proposed method obviously outperforms HMM-based tone model in speaker-independent evaluation.
引用
下载
收藏
页码:1575 / 1578
页数:4
相关论文
共 50 条
  • [21] IMPROVED TONE MODELING BY EXPLOITING ARTICULATORY FEATURES FOR MANDARIN SPEECH RECOGNITION
    Chao, Hao
    Yang, Zhanlei
    Liu, Wenju
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4741 - 4744
  • [23] Real context model for tone recognition in mandarin conversational telephone speech
    Liu, Zhaojie
    Shao, Jian
    Zhang, Pengyuan
    Zhao, Qingwei
    Yan, Yonghong
    Feng, Ji
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2007, : 696 - +
  • [24] Automatic context induction for tone model integration in mandarin speech recognition
    HUANG HaoLI Binghu Department of Information Science and EngineeringXinjiang UniversityUrumqi China Laboratory of MultiLingual Information TechnologyXinjiang UniversityUrumqi China
    TheJournalofChinaUniversitiesofPostsandTelecommunications, 2012, 19 (01) : 94 - 100
  • [25] Use tone detection to improve performance of mandarin digit speech recognition
    Tsinghua Univ, Beijing, China
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 1998, 38 (09): : 36 - 39
  • [26] Improved mandarin speech recognition by lattice rescoring with enhanced tone models
    Wang, Huanliang
    Qian, Yao
    Soong, Frank
    Zhou, Jian-Lai
    Han, Jiqing
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 445 - +
  • [27] Landmark-Guided Segmental Speech Decoding for Continuous Mandarin Speech Recognition
    Chao, Hao
    Song, Cheng
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2016, 12 (03): : 410 - 421
  • [28] Novel Extended Phonemic Set for Mandarin Continuous Speech Recognition
    谢湘
    匡镜明
    Journal of Beijing Institute of Technology, 2003, (04) : 399 - 402
  • [29] Feature selection in mandarin large vocabulary continuous speech recognition
    Zhu, X
    Chen, YN
    Liu, J
    Liu, RS
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 508 - 511
  • [30] An MRNN-based method for continuous Mandarin speech recognition
    Liao, YF
    Chen, SH
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1121 - 1124