A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH

被引:0
|
作者
He, Lei [1 ]
Hao, Jie [1 ]
机构
[1] Toshiba China Res & Dev Ctr, Beijing 100738, Peoples R China
关键词
speech recognition; tone recognition; feature selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a tone recognition framework for continuous Mandarin speech. To model the variations of F0 pattern caused by co-articulation and phonetic effects, a set of discriminating features are extracted: 1) outlined features from the F0 contours of target syllable and neighboring syllables are combined; 2) contextual tone information is utilized within an iterative process; 3) phonetic information from target and neighboring syllables is incorporated. These features are put into a decision tree for tone classification, which follows an HMM-based toneless decoder. The results in 5-tone recognition experiments show more than 40% relative error rate reduction against the baseline local outlined features. Moreover, the proposed method obviously outperforms HMM-based tone model in speaker-independent evaluation.
引用
收藏
页码:1575 / 1578
页数:4
相关论文
共 50 条
  • [31] An Efficient Algorithm for Syllable Hypothesization in Continuous Mandarin Speech Recognition
    Huang, Eng-Fong
    Wang, Hsiao-Chuan
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03): : 446 - 449
  • [32] Mandarin Speech Recognition Using Convolution Neural Network with Augmented Tone Features
    Hu, Xinhui
    Lu, Xugang
    Hori, Chiori
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 15 - 18
  • [33] Decision tree based mandarin tone model and its application to speech recognition
    Cao, Y
    Deng, YG
    Zhang, H
    Huang, TY
    Xu, B
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1759 - 1762
  • [34] Tone recognition for continuous Mandarin speech with limited training data using selected context-dependent hidden Markov models
    Wang, Hsin-Min
    Lee, Lin-Shan
    Journal of the Chinese Institute of Engineers, Transactions of the Chinese Institute of Engineers,Series A/Chung-kuo Kung Ch'eng Hsuch K'an, 1994, 17 (06): : 775 - 784
  • [35] Tone recognition in continuous Cantonese speech using supratone models
    Qian, Yao
    Lee, Tan
    Soong, Frank K.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (05): : 2936 - 2945
  • [36] TONE RECOGNITION OF CONTINUOUS MANDARINE SPEECH ASSISTED WITH PROSODIC INFORMATION
    WANG, YR
    CHEN, SH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 96 (05): : 2637 - 2645
  • [37] Tone Recognition of Isolated Mandarin Syllables
    Xie, Zhaoqiang
    Miao, Zhenjiang
    IMAGE AND SIGNAL PROCESSING, PROCEEDINGS, 2010, 6134 : 412 - 418
  • [38] Incorporating Tone Features to Convolutional Neural Network to Improve Mandarin/Thai Speech Recognition
    Hu, Xinhui
    Saiko, Masahiro
    Hori, Chiori
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [39] Continuous mandarin speech recognition using hierarchical recurrent neural networks
    Liao, YF
    Chen, WY
    Chen, SH
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3370 - 3373
  • [40] Visual information assisted mandarin large vocabulary continuous speech recognition
    Liu, P
    Wang, ZY
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 72 - 77