Using morphemes in language modeling and automatic speech recognition of Amharic

被引:1
|
作者
Tachbelie, Martha Yifiru [1 ]
Abate, Solomon Teferra [1 ]
Menzel, Wolfgang [2 ]
机构
[1] Univ Addis Ababa, Sch Informat Sci, Addis Ababa, Ethiopia
[2] Univ Hamburg, Dept Informat, Hamburg, Germany
关键词
D O I
10.1017/S1351324912000356
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents morpheme-based language models developed for Amharic (a morphologically rich Semitic language) and their application to a speech recognition task. A substantial reduction in the out of vocabulary rate has been observed as a result of using subwords or morphemes. Thus a severe problem of morphologically rich languages has been addressed. Moreover, lower perplexity values have been obtained with morpheme-based language models than with word-based models. However, when comparing the quality based on the probability assigned to the test sets, word-based models seem to fare better. We have studied the utility of morpheme-based language models in speech recognition systems and found that the performance of a relatively small vocabulary (5k) speech recognition system improved significantly as a result of using morphemes as language modeling and dictionary units. However, as the size of the vocabulary increases (20k or more) the morpheme-based systems suffer from acoustic confusability and did not achieve a significant improvement over a word-based system with an equivalent vocabulary size even with the use of higher order (quadrogram) n-gram language models.
引用
收藏
页码:235 / 259
页数:25
相关论文
共 50 条
  • [31] LANGUAGE MODEL VERBALIZATION FOR AUTOMATIC SPEECH RECOGNITION
    Sak, Hasim
    Beaufays, Francoise
    Nakajima, Kaisuke
    Allauzen, Cyril
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8262 - 8266
  • [32] GEOGRAPHIC LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION
    Xiao, Xiaoqiang
    Chen, Hong
    Zylak, Mark
    Sosa, Daniela
    Desu, Suma
    Krishnamoorthy, Mahesh
    Liu, Daben
    Paulik, Matthias
    Zhang, Yuchen
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6124 - 6128
  • [33] Automatic emotional speech recognition in Serbian language
    Bojanic, Milana
    Delic, Vlado
    [J]. 2013 21ST TELECOMMUNICATIONS FORUM (TELFOR), 2013, : 459 - 465
  • [34] Language Modeling for Mixed Language Speech Recognition using Weighted Phrase Extraction
    Li, Ying
    Fung, Pascale
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2598 - 2602
  • [35] RELEVANCE LANGUAGE MODELING FOR SPEECH RECOGNITION
    Chen, Kuan-Yu
    Chen, Berlin
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5568 - 5571
  • [36] English-Filipino Speech Topic Tagger Using Automatic Speech Recognition Modeling and Topic Modeling
    Tumpalan, John Karl B.
    Recario, Reginald Neil C.
    [J]. ADVANCES IN INFORMATION AND COMMUNICATION, FICC, VOL 2, 2023, 652 : 427 - 445
  • [37] Large vocabulary continuous speech recognition for Estonian using morphemes and classes
    Alumäe, T
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 245 - 252
  • [38] Automatic Language Identification Using Speech Rhythm Features for Multi-Lingual Speech Recognition
    Kim, Hwamin
    Park, Jeong-Sik
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (07):
  • [39] Automatic Idiom Identification Model for Amharic Language
    Fenta, Anduamlak Abebe
    Gebeyehu, Seffi
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (08)
  • [40] Automatic Speech Recognition System Channel Modeling
    Tan, Qun Feng
    Audhkhasi, Kartik
    Georgiou, Panayiotis G.
    Ettelaie, Emil
    Narayanan, Shrikanth
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2442 - 2445