Minimum Bayes risk estimation and decoding in large vocabulary continuous speech recognition

被引:6
|
作者
Byrne, W [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
来源
关键词
discriminative training; acoustic modeling; automatic speech recognition; maximum mutual information;
D O I
10.1093/ietisy/e89-d.3.900
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Minimum Bayes risk estimation and decoding strategies based on lattice segmentation techniques can be used to refine large vocabulary continuous speech recognition systems through the estimation of the parameters of the underlying hidden Markov models and through the identification of smaller recognition tasks which provides the opportunity to incorporate novel modeling and decoding procedures in LVCSR. These techniques are discussed in the context of going 'beyond HMMs', showing in particular that this process of subproblem identification makes it possible to train and apply small-domain binary pattern classifiers, such as Support Vector Machines, to large vocabulary continuous speech recognition.
引用
收藏
页码:900 / 907
页数:8
相关论文
共 50 条
  • [41] DISTRIBUTED SUBMODULAR MAXIMIZATION FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
    Qi, Jun
    Liu, Xu
    Kamijo, Shunshuke
    Tejedor, Javier
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2501 - 2505
  • [42] A large vocabulary continuous speech recognition system for Persian language
    Hossein Sameti
    Hadi Veisi
    Mohammad Bahrani
    Bagher Babaali
    Khosro Hosseinzadeh
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2011
  • [43] A word graph algorithm for large vocabulary continuous speech recognition
    Ortmanns, S
    Ney, H
    Aubert, X
    [J]. COMPUTER SPEECH AND LANGUAGE, 1997, 11 (01): : 43 - 72
  • [44] Large Vocabulary Continuous Audio-Visual Speech Recognition
    Sterpu, George
    [J]. ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, : 538 - 541
  • [45] On designing pronunciation lexicons for large vocabulary, continuous speech recognition
    Lamel, L
    Adda, G
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 6 - 9
  • [46] Large-Vocabulary Continuous Speech Recognition of Lhasa Tibetan
    Li, Guanyu
    Yu, Hongzhi
    [J]. COMPUTER AND INFORMATION TECHNOLOGY, 2014, 519-520 : 802 - 806
  • [47] DEEP-FSMN FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
    Zhang, Shiliang
    Lei, Ming
    Yan, Zhijie
    Dai, Lirong
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5869 - 5873
  • [48] Large-vocabulary continuous speech recognition: Advances and applications
    Gauvain, JL
    Lamel, L
    [J]. PROCEEDINGS OF THE IEEE, 2000, 88 (08) : 1181 - 1200
  • [49] A large-vocabulary continuous speech recognition system for Hindi
    Kumar, M
    Rajput, N
    Verma, A
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2004, 48 (5-6) : 703 - 715
  • [50] Phone deactivation pruning in large vocabulary continuous speech recognition
    Renals, S
    [J]. IEEE SIGNAL PROCESSING LETTERS, 1996, 3 (01) : 4 - 6