Discriminative-models for speech recognition

被引:0
|
作者
Gales, M. J. F. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The vast majority of automatic speech recognition systems use Hidden Markov Models (HMMs) as the underlying acoustic model. Initially these models were trained based on the maximum likelihood criterion. Significant performance gains have been obtained by using discriminative training criteria, such as maximum mutual information and minimum phone error. However, the underlying acoustic model is still generative, with the associated constraints on the state and transition probability distributions, and classification is based on Bayes' decision rule. Recently, there has been interest in examining discriminative, or direct, models for speech recognition. This paper briefly reviews the forms of discriminative models that have been investigated. These include maximum entropy Markov models, hidden conditional random fields and conditional augmented models. The relationships between the various models and issues with applying them to large vocabulary continuous speech recognition will be discussed.
引用
收藏
页码:168 / 174
页数:7
相关论文
共 50 条
  • [31] Generalized Discriminative Feature Transformation for Speech Recognition
    Hsiao, Roger
    Schultz, Tanja
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 672 - 675
  • [32] Discriminative incorporation of explicitly trained tone models into lattice based rescoring for Mandarin speech recognition
    Huang, Hao
    Zhu, Jie
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1541 - 1544
  • [33] Discriminative Training of n-gram Language Models for Speech Recognition via Linear Programming
    Magdin, Vladimir
    Jiang, Hui
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 305 - 310
  • [34] Chain-based Discriminative Autoencoders for Speech Recognition
    Lee, Hung-Shin
    Huang, Pin-Tuan
    Cheng, Yao-Fei
    Wang, Hsin-Min
    [J]. INTERSPEECH 2022, 2022, : 2078 - 2082
  • [35] Automatic speech recognition systems: A survey of discriminative techniques
    Kaur, Amrit Preet
    Singh, Amitoj
    Sachdeva, Rohit
    Kukreja, Vinay
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 13307 - 13339
  • [36] DISCRIMINATIVE LANGUAGE MODELING FOR SPEECH RECOGNITION WITH RELEVANCE INFORMATION
    Chen, Berlin
    Liu, Jia-Wen
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [37] Discriminative temporal feature extraction for robust speech recognition
    Shen, JL
    [J]. ELECTRONICS LETTERS, 1997, 33 (19) : 1598 - 1600
  • [38] Discriminative training of HMMs for automatic speech recognition: A survey
    Jiang, Hui
    [J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 589 - 608
  • [39] Survey on discriminative feature selection for speech emotion recognition
    Xu, Xin
    Li, Ya
    Xu, Xiaoying
    Wen, Zhengqi
    Che, Hao
    Liu, Shanfeng
    Tao, Jianhua
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 345 - +
  • [40] Emotion Recognition in Speech with Latent Discriminative Representations Learning
    Han, Jing
    Zhang, Zixing
    Keren, Gil
    Schuller, Bjorn
    [J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2018, 104 (05) : 737 - 740