Minimum Bayes risk estimation and decoding in large vocabulary continuous speech recognition

被引:6
|
作者
Byrne, W [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
来源
关键词
discriminative training; acoustic modeling; automatic speech recognition; maximum mutual information;
D O I
10.1093/ietisy/e89-d.3.900
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Minimum Bayes risk estimation and decoding strategies based on lattice segmentation techniques can be used to refine large vocabulary continuous speech recognition systems through the estimation of the parameters of the underlying hidden Markov models and through the identification of smaller recognition tasks which provides the opportunity to incorporate novel modeling and decoding procedures in LVCSR. These techniques are discussed in the context of going 'beyond HMMs', showing in particular that this process of subproblem identification makes it possible to train and apply small-domain binary pattern classifiers, such as Support Vector Machines, to large vocabulary continuous speech recognition.
引用
收藏
页码:900 / 907
页数:8
相关论文
共 50 条
  • [1] Lattice segmentation and minimum Bayes risk discriminative training for large vocabulary continuous speech recognition
    Doumpiotis, V
    Byrne, W
    [J]. SPEECH COMMUNICATION, 2006, 48 (02) : 142 - 160
  • [2] Hypothesis Spaces For Minimum Bayes Risk Training In Large Vocabulary Speech Recognition
    Gibson, Matthew
    Hain, Thomas
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2406 - 2409
  • [3] An overview of decoding techniques for large vocabulary continuous speech recognition
    Aubert, XL
    [J]. COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01): : 89 - 114
  • [4] Segmental minimum Bayes-Risk decoding for automatic speech recognition
    Goel, V
    Kumar, S
    Byrne, W
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (03): : 234 - 249
  • [5] Discriminative training of decoding graphs for large vocabulary continuous speech recognition
    Kuo, Hong-Kwang Jeff
    Kingsbury, Brian
    Zweig, Geoffrey
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 45 - +
  • [6] Integrating induced probability into decoding for large vocabulary continuous speech recognition
    YANG Zhanlei LIU Wenju CHAO Hao (National Laboratory of Pattern Recognition
    [J]. Chinese Journal of Acoustics, 2012, 31 (03) : 338 - 352
  • [7] Support vector machines for Segmental Minimum Bayes Risk decoding of continuous speech
    Venkataramani, V
    Chakrabartty, S
    Byrne, W
    [J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 13 - 18
  • [8] Ginisupport vector machines for segmental minimum Bayes risk decoding of continuous speech
    Venkataramani, Veera
    Chakrabartty, Shantanu
    Byrne, William
    [J]. COMPUTER SPEECH AND LANGUAGE, 2007, 21 (03): : 423 - 442
  • [9] Response Probability Based Decoding Algorithm for Large Vocabulary Continuous Speech Recognition
    Yang, Zhanlei
    Chao, Hao
    Liu, Wenju
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1940 - 1943
  • [10] Vietnamese Large Vocabulary Continuous Speech Recognition
    Ngoc Thang Vu
    Schultz, Tanja
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 333 - 338