Discriminative-models for speech recognition

被引:0
|
作者
Gales, M. J. F. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The vast majority of automatic speech recognition systems use Hidden Markov Models (HMMs) as the underlying acoustic model. Initially these models were trained based on the maximum likelihood criterion. Significant performance gains have been obtained by using discriminative training criteria, such as maximum mutual information and minimum phone error. However, the underlying acoustic model is still generative, with the associated constraints on the state and transition probability distributions, and classification is based on Bayes' decision rule. Recently, there has been interest in examining discriminative, or direct, models for speech recognition. This paper briefly reviews the forms of discriminative models that have been investigated. These include maximum entropy Markov models, hidden conditional random fields and conditional augmented models. The relationships between the various models and issues with applying them to large vocabulary continuous speech recognition will be discussed.
引用
收藏
页码:168 / 174
页数:7
相关论文
共 50 条
  • [1] Structured Discriminative Models for Speech Recognition
    Gales, Mark
    Watanabe, Shinji
    Fosler-Lussier, Eric
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 70 - 81
  • [2] Structured Discriminative Models for Speech Recognition
    Gales, Mark
    [J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : XXII - XXII
  • [3] Using SVMs and discriminative models for speech recognition
    Smith, ND
    Gales, MJF
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 77 - 80
  • [4] Discriminative training of language models for speech recognition
    Kuo, KHJ
    Fosler-Lussier, E
    Jiang, H
    Lee, CH
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 325 - 328
  • [5] Scaling Laws for Discriminative Speech Recognition Rescoring Models
    Gu, Yile
    Shivakumar, Prashanth Gurunath
    Kolehmainen, Jari
    Gandhe, Ankur
    Rastrow, Ariya
    Bulyko, Ivan
    [J]. INTERSPEECH 2023, 2023, : 471 - 475
  • [6] Multi resolution discriminative models for subvocalic speech recognition
    Raugas, Mark
    Sridhar, Vivek Kumar Rangarajan
    Prasad, Rohit
    Natarajan, Prem
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2634 - 2637
  • [7] STRUCTURED DISCRIMINATIVE MODELS FOR NOISE ROBUST CONTINUOUS SPEECH RECOGNITION
    Ragni, A.
    Gales, M. J. F.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4788 - 4791
  • [8] Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition
    Sak, Hasim
    Saraclar, Murat
    Gungor, Tunga
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (08): : 2341 - 2351
  • [9] Speech Emotion Recognition Using Hybrid Generative and Discriminative Models
    Huang, Yongming
    Zhang, Guobao
    Dong, Fei
    Li, Yue
    Da, Feipeng
    [J]. PRZEGLAD ELEKTROTECHNICZNY, 2012, 88 (3B): : 105 - 108
  • [10] Large scale discriminative training of hidden Markov models for speech recognition
    Woodland, PC
    Povey, D
    [J]. COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01): : 25 - 47