A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH

被引:0
|
作者
He, Lei [1 ]
Hao, Jie [1 ]
机构
[1] Toshiba China Res & Dev Ctr, Beijing 100738, Peoples R China
关键词
speech recognition; tone recognition; feature selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a tone recognition framework for continuous Mandarin speech. To model the variations of F0 pattern caused by co-articulation and phonetic effects, a set of discriminating features are extracted: 1) outlined features from the F0 contours of target syllable and neighboring syllables are combined; 2) contextual tone information is utilized within an iterative process; 3) phonetic information from target and neighboring syllables is incorporated. These features are put into a decision tree for tone classification, which follows an HMM-based toneless decoder. The results in 5-tone recognition experiments show more than 40% relative error rate reduction against the baseline local outlined features. Moreover, the proposed method obviously outperforms HMM-based tone model in speaker-independent evaluation.
引用
收藏
页码:1575 / 1578
页数:4
相关论文
共 50 条
  • [1] Tone Modeling for Continuous Mandarin Speech Recognition
    Cao, Yang
    Zhang, Shuwu
    Huang, Taiyi
    Xu, Bo
    [J]. International Journal of Speech Technology, 2004, 7 (2-3) : 115 - 128
  • [2] Tone recognition of continuous Mandarin speech assisted with prosodic information
    [J]. 1600, American Inst of Physics, Woodbury, NY, USA (96):
  • [3] Tone recognition of continuous Mandarin speech based on tone nucleus model and neural network
    Wang, Xiao-Dong
    Hirose, Keikichi
    Zhang, Jin-Song
    Minematsu, Nobuaki
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (06) : 1748 - 1755
  • [4] An Investigation of the Target Approximation Model for Tone Modeling and Recognition in Continuous Mandarin Speech
    Gao, Yingming
    Zhang, Xinyu
    Xu, Yi
    Zhang, Jinsong
    Birkholz, Peter
    [J]. INTERSPEECH 2020, 2020, : 1913 - 1917
  • [5] Detecting tone errors in continuous Mandarin speech
    Zhang, Yan-Bin
    Chu, Min
    Huang, Chao
    Liang, Man-Gui
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5065 - +
  • [6] TONE RECOGNITION FOR CONTINUOUS ACCENTED MANDARIN CHINESE
    Wu, Jiang
    Zahorian, Stephen A.
    Hu, Hongbing
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7180 - 7183
  • [7] Tone articulation modeling for mandarin spontaneous speech recognition
    Zhou, JL
    Tian, Y
    Shi, Y
    Huang, C
    Chang, E
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 997 - 1000
  • [8] Pitch tracking and tone features for Mandarin speech recognition
    Huang, HCH
    Seide, F
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1523 - 1526
  • [9] A quantitative assessment of the importance of tone in mandarin speech recognition
    Ng, T
    Siu, MH
    Ostendorf, M
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (12) : 867 - 870
  • [10] Phonemic segmentation for continuous Mandarin speech recognition
    Tokyo Inst of Technology, Yokohama, Japan
    [J]. J Acoust Soc Jpn E, 1 (1-8):