Large vocabulary continuous speech recognition of an inflected language using stems and endings

被引：24

作者：

Rotovnik, Tomaz ^{[1
]}

Maucec, Mirjam Sepesy ^{[1
]}

Kacic, Zdravko ^{[1
]}

机构：

[1] Univ Maribor, Fac Elect Engn & Comp Sci, Smetanova 17, SLO-2000 Maribor, Slovenia

来源：

SPEECH COMMUNICATION | 2007年 / 49卷 / 06期

关键词：

large vocabulary continuous speech recognition; sub-word modeling; search algorithm; stem; ending;

D O I：

10.1016/j.specom.2007.02.010

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this article, we focus on creating a large vocabulary speech recognition system for the Slovenian language. Currently, state-of-heart recognition systems are able to use vocabularies with sizes of 20,000 to 100,000 words. These systems have mostly been developed for English, which belongs to a group of uninflectional languages. Slovenian, as a Slavic language, belongs to a group of inflectional languages. Its rich morphology presents a major problem in large vocabulary speech recognition. Compared to English, the Slovenian language requires a vocabulary approximately 10 times greater for the same degree of text coverage. Consequently, the difference in vocabulary size causes a high degree of OOV (out-of-vocabulary words). Therefore OOV words have a direct impact on recognizer efficiency. The characteristics of inflectional languages have been considered when developing a new search algorithm with a method for restricting the correct order of sub-word units, and to use separate language models based on sub-words. This search algorithm combines the properties of sub-word-based models (reduced OOV) and word-based models (the length of context). The algorithm also enables better search-space limitation for sub-word models. Using sub-word models, we increase recognizer accuracy and achieve a comparable search space to that of a standard word-based recognizer. Our methods were evaluated in experiments on a SNABI speech database. (C) 2007 Elsevier B.V. All rights reserved.

引用

页码：437 / 452

页数：16

共 50 条

[1] Automatic language identification using large vocabulary continuous speech recognition
Mendoza, S
Gillick, L
Ito, Y
Lowe, S
Newmann, M
[J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 785 - 788
[2] A large vocabulary continuous speech recognition system for Persian language
Sameti, Hossein
Veisi, Hadi
Bahrani, Mohammad
Babaali, Bagher
Hosseinzadeh, Khosro
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 12
[3] A large vocabulary continuous speech recognition system for Persian language
Hossein Sameti
Hadi Veisi
Mohammad Bahrani
Bagher Babaali
Khosro Hosseinzadeh
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2011
[4] Connectionist language modeling for large vocabulary continuous speech recognition
Schwenk, H
Gauvain, JL
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 765 - 768
[5] A unified language model for large vocabulary continuous speech recognition of Turkish
Arisoy, Ebru
Dutagaci, Helin
Arslan, Levent M.
[J]. SIGNAL PROCESSING, 2006, 86 (10) : 2844 - 2862
[6] Using a transcription graph for large vocabulary continuous speech recognition
Li, Z
OShaughnessy, D
[J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 121 - 124
[7] Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish
Vanhainen, Niklas
Salvi, Giampiero
[J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
[8] Advances in large vocabulary continuous speech recognition
Zweig, G
Picheny, M
[J]. ADVANCES IN COMPUTERS, VOL. 60: INFORMATION SECURITY, 2004, 60 : 249 - 291
[9] Vietnamese Large Vocabulary Continuous Speech Recognition
Ngoc Thang Vu
Schultz, Tanja
[J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 333 - 338
[10] Syllable Based Language Model for Large Vocabulary Continuous Speech Recognition of Polish
Majewski, Piotr
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 397 - 401

← 1 2 3 4 5 →