Discriminative tonal feature extraction method in mandarin speech recognition

被引:1
|
作者
HUANG Hao
机构
关键词
discriminative training; tone recognition; feature extraction; Mandarin speech recognition;
D O I
暂无
中图分类号
TN912.34 [语音识别与设备];
学科分类号
0711 ;
摘要
To utilize the supra-segmental nature of Mandarin tones, this article proposes a feature extraction method for hidden markov model (HMM) based tone modeling. The method uses linear transforms to project F0 (fundamental frequency) features of neighboring syllables as compensations, and adds them to the original F0 features of the current syllable. The transforms are discriminatively trained by using an objective function termed as "minimum tone error", which is a smooth approximation of tone recognition accuracy. Experiments show that the new tonal features achieve 3.82% tone recognition rate improvement, compared with the baseline, using maximum likelihood trained HMM on the normal F0 features. Further experiments show that discriminative HMM training on the new features is 8.78% better than the baseline.
引用
收藏
页码:126 / 130
页数:5
相关论文
共 50 条
  • [1] Robust speech recognition method based on discriminative environment feature extraction
    Jiqing Han
    Wen Gao
    [J]. Journal of Computer Science and Technology, 2001, 16 : 458 - 464
  • [2] Robust speech recognition method based on discriminative environment feature extraction
    Han, JQ
    Gao, W
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2001, 16 (05) : 458 - 464
  • [3] Robust Speech Recognition Method Based on Discriminative Environment Feature Extraction
    韩纪庆
    高文
    [J]. Journal of Computer Science & Technology, 2001, (05) : 458 - 464
  • [4] Discriminative temporal feature extraction for robust speech recognition
    Shen, JL
    [J]. ELECTRONICS LETTERS, 1997, 33 (19) : 1598 - 1600
  • [5] Information Extraction and Noisy Feature Pruning for Mandarin Speech Recognition
    Gao, Guozhi
    Duan, Zhikui
    Yang, Guangguang
    Li, Shiren
    Yu, Xinmei
    Zhao, Xiaomeng
    Ruan, Jinbiao
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2024, 72 (1-2): : 59 - 70
  • [6] Robust endpoint detection for speech recognition based on discriminative feature extraction
    Yamamoto, Koichi
    Jabloun, Firas
    Reinhard, Klaus
    Kawamura, Akinori
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 805 - 808
  • [7] Discriminative feature extraction for speech recognition using continuous output codes
    Dehzangi, Omid
    Ma, Bin
    Chng, Eng Siong
    Li, Haizhou
    [J]. PATTERN RECOGNITION LETTERS, 2012, 33 (13) : 1703 - 1709
  • [8] Discriminative transform for confidence estimation in Mandarin speech recognition
    Guo, G
    Wang, RH
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 269 - 272
  • [9] Automatic Visual Feature Extraction for Mandarin Audio-Visual Speech Recognition
    Pao, Tsang-Long
    Liao, Wen-Yuan
    Wu, Tsan-Nung
    Lin, Ching-Yi
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2936 - 2940
  • [10] Speech Emotion Recognition with Discriminative Feature Learning
    Zhou, Huan
    Liu, Kai
    [J]. INTERSPEECH 2020, 2020, : 4094 - 4097