Statistical language modeling with semantic classes for large vocabulary speech recognition in embedded systems

被引：0

作者：

Oria, Daniela ^{[1
]}

Olsen, Jesper ^{[1
]}

机构：

[1] Nokia Res Ctr, Itamerenkatu 11-13, FIN-00180 Helsinki, Finland

来源：

PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE | 2006年

关键词：

language modelling; dictation; embedded ASR; large vocabulary ASR;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we investigate the use of semantic classes in an n-gram language model used for speech dictation in an embedded environment. The alternative to using semantic classes is to use automatically clustered word classes, but with this approach it is often difficult to increase the modeling accuracy and reduce the model size at the same time - two factors that are critical for an embedded application. We describe how the introduction of semantic classes in a large vocabulary (33000 words) embedded dictation task for US English reduced the model size by 16.0%, while at the same time also reducing the word error rate by 12.0% relatively.

引用

下载

页码：496 / +

页数：2

共 50 条

[1] Large vocabulary Russian speech recognition using syntactico-statistical language modeling
Karpov, Alexey
Markov, Konstantin
Kipyatkova, Irina
Vazhenina, Dania
Ronzhin, Andrey
SPEECH COMMUNICATION, 2014, 56 : 213 - 228
[2] Large vocabulary speech recognition with multispan statistical language models
Bellegarda, JR
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (01): : 76 - 84
[3] A multispan language modeling framework for large vocabulary speech recognition
Bellegarda, JR
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (05): : 456 - 467
[4] Connectionist language modeling for large vocabulary continuous speech recognition
Schwenk, H
Gauvain, JL
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 765 - 768
[5] Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition
Pakoci, Edvin
Popovic, Branislav
Pekar, Darko
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
[6] SPEECH RECOGNITION FOR LARGE-VOCABULARY SYSTEMS
JACOB, B
ANDREOBRECHT, R
JOURNAL DE PHYSIQUE IV, 1994, 4 (C5): : 489 - 492
[7] Latent semantic language modeling for speech recognition
Bellegarda, JR
MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 73 - 103
[8] Subspace Gaussian mixture based language modeling for large vocabulary continuous speech recognition
Sun, Ri Hyon
Chol, Ri Jong
SPEECH COMMUNICATION, 2020, 117 : 21 - 27
[9] A tutorial on pronunciation modeling for large vocabulary speech recognition
Fosler-Lussier, E
TEXT- AND SPEECH-TRIGGERED INFORMATION ACCESS, 2003, 2705 : 38 - 77
[10] Prosodic Modeling in Large Vocabulary Mandarin Speech Recognition
Huang, Jui-Ting
Lee, Lin-shan
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1241 - 1244

← 1 2 3 4 5 →