Using morphemes in language modeling and automatic speech recognition of Amharic

被引:1
|
作者
Tachbelie, Martha Yifiru [1 ]
Abate, Solomon Teferra [1 ]
Menzel, Wolfgang [2 ]
机构
[1] Univ Addis Ababa, Sch Informat Sci, Addis Ababa, Ethiopia
[2] Univ Hamburg, Dept Informat, Hamburg, Germany
关键词
D O I
10.1017/S1351324912000356
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents morpheme-based language models developed for Amharic (a morphologically rich Semitic language) and their application to a speech recognition task. A substantial reduction in the out of vocabulary rate has been observed as a result of using subwords or morphemes. Thus a severe problem of morphologically rich languages has been addressed. Moreover, lower perplexity values have been obtained with morpheme-based language models than with word-based models. However, when comparing the quality based on the probability assigned to the test sets, word-based models seem to fare better. We have studied the utility of morpheme-based language models in speech recognition systems and found that the performance of a relatively small vocabulary (5k) speech recognition system improved significantly as a result of using morphemes as language modeling and dictionary units. However, as the size of the vocabulary increases (20k or more) the morpheme-based systems suffer from acoustic confusability and did not achieve a significant improvement over a word-based system with an equivalent vocabulary size even with the use of higher order (quadrogram) n-gram language models.
引用
收藏
页码:235 / 259
页数:25
相关论文
共 50 条
  • [41] Prosody modeling for automatic speech recognition and understanding
    Shriberg, E
    Stolcke, A
    [J]. MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 105 - 114
  • [42] FEDERATED ACOUSTIC MODELING FOR AUTOMATIC SPEECH RECOGNITION
    Cui, Xiaodong
    Lu, Songtao
    Kingsbury, Brian
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6748 - 6752
  • [43] Duration Modeling in Automatic Recited Speech Recognition
    Alotaibi, Yousef A.
    Yakoub, Mohammed Sidi
    Meftah, Ali
    Selouani, Sid-Ahmed
    [J]. 2016 39TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2016, : 323 - 326
  • [44] Sentence-Level Automatic Speech Segmentation for Amharic
    Tamiru, Rahel Mekonen
    Abate, Solomon Teferra
    [J]. PROCEEDINGS OF SIXTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2021), VOL 2, 2022, 236 : 477 - 485
  • [45] DYNAMIC ADJUSTMENT OF LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION USING WORD SIMILARITY
    Currey, Anna
    Illina, Irina
    Fohr, Dominique
    [J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 426 - 432
  • [46] Applications of automatic speech recognition to speech and language development in young children
    Russell, M
    Brown, C
    Skilling, A
    Series, R
    Wallace, J
    Bonham, B
    Barker, P
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 176 - 179
  • [48] Automatic Speech Recognition for Supporting Endangered Language Documentation
    Prud'hommeaux, Emily
    Jimerson, Robbie
    Hatcher, Richard
    Michelson, Karin
    [J]. LANGUAGE DOCUMENTATION & CONSERVATION, 2021, 15 : 491 - 513
  • [49] Skrybot - A System for Automatic Speech Recognition of Polish Language
    Pawlaczyk, Leslaw
    Bosky, Pawel
    [J]. MAN-MACHINE INTERACTIONS, 2009, 59 : 381 - +
  • [50] IMPROVING AUTOMATIC SPEECH RECOGNITION ROBUSTNESS FOR THE ROMANIAN LANGUAGE
    Buzo, Andi
    Cucu, Horia
    Burileanu, Corneliu
    [J]. 19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2119 - 2122