Statistical language modeling with semantic classes for large vocabulary speech recognition in embedded systems

被引:0
|
作者
Oria, Daniela [1 ]
Olsen, Jesper [1 ]
机构
[1] Nokia Res Ctr, Itamerenkatu 11-13, FIN-00180 Helsinki, Finland
关键词
language modelling; dictation; embedded ASR; large vocabulary ASR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we investigate the use of semantic classes in an n-gram language model used for speech dictation in an embedded environment. The alternative to using semantic classes is to use automatically clustered word classes, but with this approach it is often difficult to increase the modeling accuracy and reduce the model size at the same time - two factors that are critical for an embedded application. We describe how the introduction of semantic classes in a large vocabulary (33000 words) embedded dictation task for US English reduced the model size by 16.0%, while at the same time also reducing the word error rate by 12.0% relatively.
引用
收藏
页码:496 / +
页数:2
相关论文
共 50 条
  • [1] Large vocabulary Russian speech recognition using syntactico-statistical language modeling
    Karpov, Alexey
    Markov, Konstantin
    Kipyatkova, Irina
    Vazhenina, Dania
    Ronzhin, Andrey
    [J]. SPEECH COMMUNICATION, 2014, 56 : 213 - 228
  • [2] Large vocabulary speech recognition with multispan statistical language models
    Bellegarda, JR
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (01): : 76 - 84
  • [3] A multispan language modeling framework for large vocabulary speech recognition
    Bellegarda, JR
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (05): : 456 - 467
  • [4] Connectionist language modeling for large vocabulary continuous speech recognition
    Schwenk, H
    Gauvain, JL
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 765 - 768
  • [5] Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition
    Pakoci, Edvin
    Popovic, Branislav
    Pekar, Darko
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
  • [6] SPEECH RECOGNITION FOR LARGE-VOCABULARY SYSTEMS
    JACOB, B
    ANDREOBRECHT, R
    [J]. JOURNAL DE PHYSIQUE IV, 1994, 4 (C5): : 489 - 492
  • [7] Latent semantic language modeling for speech recognition
    Bellegarda, JR
    [J]. MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 73 - 103
  • [8] Subspace Gaussian mixture based language modeling for large vocabulary continuous speech recognition
    Sun, Ri Hyon
    Chol, Ri Jong
    [J]. SPEECH COMMUNICATION, 2020, 117 : 21 - 27
  • [9] A tutorial on pronunciation modeling for large vocabulary speech recognition
    Fosler-Lussier, E
    [J]. TEXT- AND SPEECH-TRIGGERED INFORMATION ACCESS, 2003, 2705 : 38 - 77
  • [10] Prosodic Modeling in Large Vocabulary Mandarin Speech Recognition
    Huang, Jui-Ting
    Lee, Lin-shan
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1241 - 1244