Discriminative-models for speech recognition

被引：0

作者：

Gales, M. J. F. ^{[1
]}

机构：

[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England

来源：

2007 INFORMATION THEORY AND APPLICATIONS WORKSHOP | 2007年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The vast majority of automatic speech recognition systems use Hidden Markov Models (HMMs) as the underlying acoustic model. Initially these models were trained based on the maximum likelihood criterion. Significant performance gains have been obtained by using discriminative training criteria, such as maximum mutual information and minimum phone error. However, the underlying acoustic model is still generative, with the associated constraints on the state and transition probability distributions, and classification is based on Bayes' decision rule. Recently, there has been interest in examining discriminative, or direct, models for speech recognition. This paper briefly reviews the forms of discriminative models that have been investigated. These include maximum entropy Markov models, hidden conditional random fields and conditional augmented models. The relationships between the various models and issues with applying them to large vocabulary continuous speech recognition will be discussed.

引用

页码：168 / 174

页数：7

共 50 条

[1] Structured Discriminative Models for Speech Recognition
Gales, Mark
Watanabe, Shinji
Fosler-Lussier, Eric
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 70 - 81
[2] Structured Discriminative Models for Speech Recognition
Gales, Mark
[J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : XXII - XXII
[3] Using SVMs and discriminative models for speech recognition
Smith, ND
Gales, MJF
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 77 - 80
[4] Discriminative training of language models for speech recognition
Kuo, KHJ
Fosler-Lussier, E
Jiang, H
Lee, CH
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 325 - 328
[5] Scaling Laws for Discriminative Speech Recognition Rescoring Models
Gu, Yile
Shivakumar, Prashanth Gurunath
Kolehmainen, Jari
Gandhe, Ankur
Rastrow, Ariya
Bulyko, Ivan
[J]. INTERSPEECH 2023, 2023, : 471 - 475
[6] Multi resolution discriminative models for subvocalic speech recognition
Raugas, Mark
Sridhar, Vivek Kumar Rangarajan
Prasad, Rohit
Natarajan, Prem
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2634 - 2637
[7] STRUCTURED DISCRIMINATIVE MODELS FOR NOISE ROBUST CONTINUOUS SPEECH RECOGNITION
Ragni, A.
Gales, M. J. F.
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4788 - 4791
[8] Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition
Sak, Hasim
Saraclar, Murat
Gungor, Tunga
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (08): : 2341 - 2351
[9] Speech Emotion Recognition Using Hybrid Generative and Discriminative Models
Huang, Yongming
Zhang, Guobao
Dong, Fei
Li, Yue
Da, Feipeng
[J]. PRZEGLAD ELEKTROTECHNICZNY, 2012, 88 (3B): : 105 - 108
[10] Large scale discriminative training of hidden Markov models for speech recognition
Woodland, PC
Povey, D
[J]. COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01): : 25 - 47

← 1 2 3 4 5 →