Speech and Language Resources for LVCSR of Russian

被引:0
|
作者
Zablotskiy, Sergey [1 ]
Shvets, Alexander [2 ]
Sidorov, Maxim [3 ]
Semenkin, Eugene [3 ]
Minker, Wolfgang [1 ]
机构
[1] Univ Ulm, Inst Commun Engn, Ulm, Germany
[2] RAS, Inst Syst Anal, Moscow, Russia
[3] Siberian State Aerosp Univ, Insitute Syst Anal, Krasnoyarsk, Russia
关键词
LVCSR; Russian; language modelling; sub-word units;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
A syllable-based language model reduces the lexicon size by hundreds of times. It is especially beneficial in case of highly inflective languages like Russian due to the abundance of word forms according to various grammatical categories. However, the main arising challenge is the concatenation of recognised syllables into the originally spoken sentence or phrase, particularly in the presence of syllable recognition mistakes. Natural fluent speech does not usually incorporate clear information about the outside borders of the spoken words. In this paper a method for the syllable concatenation and error correction is suggested and tested. It is based on the designed co-evolutionary asymptotic probabilistic genetic algorithm for the determination of the most likely sentence corresponding to the recognized chain of syllables within an acceptable time frame. The advantage of this genetic algorithm modification is the minimum number of settings to be manually adjusted comparing to the standard algorithm. Data used for acoustic and language modelling are also described here. A special issue is the preprocessing of the textual data, particularly, handling of abbreviations, Arabic and Roman numerals, since their inflection mostly depends on the context and grammar.
引用
收藏
页码:3374 / 3377
页数:4
相关论文
共 50 条
  • [1] Speech Resources for a Serbian LVCSR System
    Ostrogonac, Stevan
    Suzic, Sinisa
    Bojanic, Milana
    Pakoci, Edvin
    [J]. 2013 21ST TELECOMMUNICATIONS FORUM (TELFOR), 2013, : 478 - +
  • [2] Factored Language Modeling for Russian LVCSR
    Vazhenina, Daria
    Markov, Konstantin
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY & UBI-MEDIA COMPUTING (ICAST-UMEDIA), 2013, : 205 - 210
  • [3] Sub-word Language Modeling for Russian LVCSR
    Zablotskiy, Sergey
    Minker, Wolfgang
    [J]. SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 413 - 421
  • [4] NIST speech processing evaluations: LVCSR, speaker recognition, language recognition
    Martin, Alvin F.
    Garofolo, John S.
    [J]. 2007 IEEE WORKSHOP ON SIGNAL PROCESSING APPLICATIONS FOR PUBLIC SECURITY AND FORENSICS, 2007, : 32 - +
  • [5] Improving of LVCSR for Causal Czech Using Publicly Available Language Resources
    Mizera, Petr
    Pollak, Petr
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 427 - 437
  • [6] RUSSIAN LANGUAGE AND CULTURE OF SPEECH
    Kazmirchuk, O. Yu.
    [J]. NOVYI FILOLOGICHESKII VESTNIK-NEW PHILOLOGICAL BULLETIN, 2009, (08):
  • [7] Speech Resources in the Tamasheq Language
    Boito, Marcely Zanon
    Bougares, Fethi
    Barbier, Florentin
    Gahbiche, Souhir
    Barrault, Loic
    Rouvier, Mickael
    Esteve, Yannick
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 2066 - 2071
  • [8] Improving Russian LVCSR Using Deep Neural Networks for Acoustic and Language Modeling
    Kipyatkova, Irina
    [J]. SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 291 - 300
  • [9] On the Development of Speech Resources for the Mixtec Language
    Caballero-Morales, Santiago-Omar
    [J]. SCIENTIFIC WORLD JOURNAL, 2013,
  • [10] LVCSR with Transformer Language Models
    Beck, Eugen
    Schlueter, Ralf
    Ney, Hermann
    [J]. INTERSPEECH 2020, 2020, : 1798 - 1802