Discriminative-models for speech recognition

被引:0
|
作者
Gales, M. J. F. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The vast majority of automatic speech recognition systems use Hidden Markov Models (HMMs) as the underlying acoustic model. Initially these models were trained based on the maximum likelihood criterion. Significant performance gains have been obtained by using discriminative training criteria, such as maximum mutual information and minimum phone error. However, the underlying acoustic model is still generative, with the associated constraints on the state and transition probability distributions, and classification is based on Bayes' decision rule. Recently, there has been interest in examining discriminative, or direct, models for speech recognition. This paper briefly reviews the forms of discriminative models that have been investigated. These include maximum entropy Markov models, hidden conditional random fields and conditional augmented models. The relationships between the various models and issues with applying them to large vocabulary continuous speech recognition will be discussed.
引用
收藏
页码:168 / 174
页数:7
相关论文
共 50 条
  • [21] DISCRIMINATIVE OUTPUT CODING FEATURES FOR SPEECH RECOGNITION
    Dehzangi, Omid
    Ma, Bin
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 89 - 92
  • [22] Jointly Optimized Discriminative Features for Speech Recognition
    Ng, Tim
    Zhang, Bing
    Long Nguyen
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2626 - 2629
  • [23] Speech Emotion Recognition with Discriminative Feature Learning
    Zhou, Huan
    Liu, Kai
    [J]. INTERSPEECH 2020, 2020, : 4094 - 4097
  • [24] Discriminative pronunciation modeling for dialectal speech recognition
    Lehr, Maider
    Gorman, Kyle
    Shafran, Izhak
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1458 - 1462
  • [25] Discriminative Feature Learning for Speech Emotion Recognition
    Zhang, Yuying
    Zou, Yuexian
    Peng, Junyi
    Luo, Danqing
    Huang, Dongyan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: TEXT AND TIME SERIES, PT IV, 2019, 11730 : 198 - 210
  • [26] RESCOREBERT: DISCRIMINATIVE SPEECH RECOGNITION RESCORING WITH BERT
    Xu, Liyan
    Gu, Yile
    Kolehmainen, Jari
    Khan, Haidar
    Gandhe, Ankur
    Rastrow, Ariya
    Stoleke, Andreas
    Bulyko, Ivan
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6117 - 6121
  • [27] Discriminative auditory features for robust speech recognition
    Mak, B
    Tam, YC
    Li, Q
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 381 - 384
  • [28] Discriminative Techniques for Hindi Speech Recognition System
    Aggarwal, Rajesh Kumar
    Dave, Mayank
    [J]. INFORMATION SYSTEMS FOR INDIAN LANGUAGES, 2011, 139 : 261 - 266
  • [29] Noisy speech recognition performance of discriminative HTWMs
    Du, Jun
    Liu, Peng
    Soong, Frank
    Zhou, Jian-Lai
    Wang, Ren-Hua
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 358 - +
  • [30] Discriminative Analysis of Distortion Sequences in Speech Recognition
    Chang, Pao-Chung
    Chen, Sin-Horng
    Juang, Biing-Hwang
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (03): : 326 - 333