Large vocabulary continuous speech recognition of an inflected language using stems and endings

被引:24
|
作者
Rotovnik, Tomaz [1 ]
Maucec, Mirjam Sepesy [1 ]
Kacic, Zdravko [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, Smetanova 17, SLO-2000 Maribor, Slovenia
关键词
large vocabulary continuous speech recognition; sub-word modeling; search algorithm; stem; ending;
D O I
10.1016/j.specom.2007.02.010
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this article, we focus on creating a large vocabulary speech recognition system for the Slovenian language. Currently, state-of-heart recognition systems are able to use vocabularies with sizes of 20,000 to 100,000 words. These systems have mostly been developed for English, which belongs to a group of uninflectional languages. Slovenian, as a Slavic language, belongs to a group of inflectional languages. Its rich morphology presents a major problem in large vocabulary speech recognition. Compared to English, the Slovenian language requires a vocabulary approximately 10 times greater for the same degree of text coverage. Consequently, the difference in vocabulary size causes a high degree of OOV (out-of-vocabulary words). Therefore OOV words have a direct impact on recognizer efficiency. The characteristics of inflectional languages have been considered when developing a new search algorithm with a method for restricting the correct order of sub-word units, and to use separate language models based on sub-words. This search algorithm combines the properties of sub-word-based models (reduced OOV) and word-based models (the length of context). The algorithm also enables better search-space limitation for sub-word models. Using sub-word models, we increase recognizer accuracy and achieve a comparable search space to that of a standard word-based recognizer. Our methods were evaluated in experiments on a SNABI speech database. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:437 / 452
页数:16
相关论文
共 50 条
  • [31] Investigation on large vocabulary continuous Kannada speech recognition
    Vanajakshi, Puttaswamy Gowda
    Mathivanan, M.
    Kumaran, T. Senthil
    [J]. INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2021, 36 (01) : 1 - 24
  • [32] Recent Developments in Large Vocabulary Continuous Speech Recognition
    Saon, George
    Chien, Jen-Tzung
    [J]. 2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [33] A Myanmar Large Vocabulary Continuous Speech Recognition System
    Naing, Hay Mar Soe
    Hlaing, Aye Mya
    Pa, Win Pa
    Hu, Xinhui
    Thu, Ye Kyaw
    Hori, Chiori
    Kawai, Hisashi
    [J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 320 - 327
  • [34] Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data
    Wang, HM
    Ho, TH
    Yang, RC
    Shen, JL
    Bai, BR
    Hong, JC
    Chen, WP
    Yu, TL
    Lee, LS
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (02): : 195 - 200
  • [35] Towards speech rate independence in large vocabulary continuous speech recognition
    Martinez, F
    Tapias, D
    Alvarez, J
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 725 - 728
  • [36] Robust spoken Language Identification using Large Vocabulary Speech Recognition.
    Hieronymus, JL
    Kadambe, S
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1111 - 1114
  • [37] Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition
    Pakoci, Edvin
    Popovic, Branislav
    Pekar, Darko
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
  • [38] Large vocabulary continuous Mandarin speech recognition using finite state machine
    Pan, YC
    Yu, CH
    Lee, LS
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 5 - 8
  • [39] Use of Gaussian Selection in large vocabulary continuous speech recognition using HMMS
    Knill, KM
    Gales, MJF
    Young, SJ
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 470 - 473
  • [40] Improved Lattice Rescoring by Using Speech Attributes in Large Vocabulary Continuous Speech Recognition Systems
    Gao, Xinglong
    Zhang, Qingqing
    Pan, Jielin
    [J]. 2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 143 - 147