Large vocabulary continuous speech recognition of an inflected language using stems and endings

被引：24

作者：

Rotovnik, Tomaz ^{[1
]}

Maucec, Mirjam Sepesy ^{[1
]}

Kacic, Zdravko ^{[1
]}

机构：

[1] Univ Maribor, Fac Elect Engn & Comp Sci, Smetanova 17, SLO-2000 Maribor, Slovenia

来源：

SPEECH COMMUNICATION | 2007年 / 49卷 / 06期

关键词：

large vocabulary continuous speech recognition; sub-word modeling; search algorithm; stem; ending;

D O I：

10.1016/j.specom.2007.02.010

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this article, we focus on creating a large vocabulary speech recognition system for the Slovenian language. Currently, state-of-heart recognition systems are able to use vocabularies with sizes of 20,000 to 100,000 words. These systems have mostly been developed for English, which belongs to a group of uninflectional languages. Slovenian, as a Slavic language, belongs to a group of inflectional languages. Its rich morphology presents a major problem in large vocabulary speech recognition. Compared to English, the Slovenian language requires a vocabulary approximately 10 times greater for the same degree of text coverage. Consequently, the difference in vocabulary size causes a high degree of OOV (out-of-vocabulary words). Therefore OOV words have a direct impact on recognizer efficiency. The characteristics of inflectional languages have been considered when developing a new search algorithm with a method for restricting the correct order of sub-word units, and to use separate language models based on sub-words. This search algorithm combines the properties of sub-word-based models (reduced OOV) and word-based models (the length of context). The algorithm also enables better search-space limitation for sub-word models. Using sub-word models, we increase recognizer accuracy and achieve a comparable search space to that of a standard word-based recognizer. Our methods were evaluated in experiments on a SNABI speech database. (C) 2007 Elsevier B.V. All rights reserved.

引用

页码：437 / 452

页数：16

共 50 条

[31] Investigation on large vocabulary continuous Kannada speech recognition
Vanajakshi, Puttaswamy Gowda
Mathivanan, M.
Kumaran, T. Senthil
[J]. INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2021, 36 (01) : 1 - 24
[32] Recent Developments in Large Vocabulary Continuous Speech Recognition
Saon, George
Chien, Jen-Tzung
[J]. 2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
[33] A Myanmar Large Vocabulary Continuous Speech Recognition System
Naing, Hay Mar Soe
Hlaing, Aye Mya
Pa, Win Pa
Hu, Xinhui
Thu, Ye Kyaw
Hori, Chiori
Kawai, Hisashi
[J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 320 - 327
[34] Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data
Wang, HM
Ho, TH
Yang, RC
Shen, JL
Bai, BR
Hong, JC
Chen, WP
Yu, TL
Lee, LS
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (02): : 195 - 200
[35] Towards speech rate independence in large vocabulary continuous speech recognition
Martinez, F
Tapias, D
Alvarez, J
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 725 - 728
[36] Robust spoken Language Identification using Large Vocabulary Speech Recognition.
Hieronymus, JL
Kadambe, S
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1111 - 1114
[37] Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition
Pakoci, Edvin
Popovic, Branislav
Pekar, Darko
[J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
[38] Large vocabulary continuous Mandarin speech recognition using finite state machine
Pan, YC
Yu, CH
Lee, LS
[J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 5 - 8
[39] Use of Gaussian Selection in large vocabulary continuous speech recognition using HMMS
Knill, KM
Gales, MJF
Young, SJ
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 470 - 473
[40] Improved Lattice Rescoring by Using Speech Attributes in Large Vocabulary Continuous Speech Recognition Systems
Gao, Xinglong
Zhang, Qingqing
Pan, Jielin
[J]. 2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 143 - 147

← 1 2 3 4 5 →