Tone modeling based on hidden conditional random fields and discriminative model weight training

被引:0
|
作者
Department of Electronic Engineering, Shanghai Jiaotong University, Shanghai 200240, China [1 ]
机构
来源
Trans. Nanjing Univ. Aero. Astro. | 2008年 / 1卷 / 43-49期
关键词
Discriminative model weight training (DMWT) - Hidden conditional random fields (HCRFs) - Large vocabulary continuous speech recognition (LVCSR) - Minimum phone error (MPE) - Tone recognition;
D O I
暂无
中图分类号
学科分类号
摘要
The use of hidden conditional random fields (HCRFs) for tone modeling is explored. The tone recognition performance is improved using HCRFs by taking advantage of intra-syllable dynamic, inter-syllable dynamic and duration features. When the tone model is integrated into continuous speech recognition, the discriminative model weight training (DMWT) is proposed. Acoustic and tone scores are scaled by model weights discrimina-tively trained by the minimum phone error (MPE) criterion. Two schemes of weight training are evaluated and a smoothing technique is used to make training robust to overtraining problem. Experiments show that the accuracies of tone recognition and large vocabulary continuous speech recognition (LVCSR) can be improved by the HCRFs based tone model. Compared with the global weight scheme, continuous speech recognition can be improved by the discriminative trained weight combinations.
引用
下载
收藏
页码:43 / 49
相关论文
共 50 条
  • [1] Training algorithms for hidden conditional random fields
    Mahajan, Milind
    Gunawardana, Asela
    Acero, Alex
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 273 - 276
  • [2] Acoustic Features for Hidden Conditional Random Fields-Based Thai Tone Classification
    Kertkeidkachorn, Natthawut
    Punyabukkana, Proadpran
    Suchato, Atiwong
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2016, 15 (02)
  • [3] Discriminative Training of Conditional Random Fields with Probably Submodular Constraints
    Maxim Berman
    Matthew B. Blaschko
    International Journal of Computer Vision, 2020, 128 : 1722 - 1735
  • [4] Minimum tag error for discriminative training of conditional random fields
    Xiong, Ying
    Zhu, Jie
    Huang, Hao
    Xu, Haihua
    INFORMATION SCIENCES, 2009, 179 (1-2) : 169 - 179
  • [5] Discriminative Training of Conditional Random Fields with Probably Submodular Constraints
    Berman, Maxim
    Blaschko, Matthew B.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (06) : 1722 - 1735
  • [6] Modeling Broad Context for Tone Recognition with Conditional Random Fields
    Wang, Siwei
    Levow, Gina-Anne
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2300 - +
  • [7] Tone model integration based on discriminative weight training for Putonghua speech recognition
    HUANG Hao ZHU Jie (Department of Electronic Engineering
    Chinese Journal of Acoustics, 2008, (03) : 193 - 202
  • [8] Hidden conditional random fields
    Quattoni, Ariadna
    Wang, Sybor
    Morency, Louis-Philippe
    Collins, Michael
    Darrell, Trevor
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (10) : 1848 - 1853
  • [9] DISCRIMINATIVE DURATION MODELING FOR SPEECH RECOGNITION WITH SEGMENTAL CONDITIONAL RANDOM FIELDS
    Kao, Justine T.
    Zweig, Geoffrey
    Nguyen, Patrick
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4476 - 4479
  • [10] Discriminative Word Alignment with Conditional Random Fields
    Blunsom, Phil
    Cohn, Trevor
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 65 - 72