Lexical modeling for the development of Amharic automatic speech recognition systems

被引:0
|
作者
Tachbelie, Martha Yifiru [1 ]
Abate, Solomon Teferra [1 ]
机构
[1] Addis Ababa Univ, Sch Informat Sci, Addis Ababa, Ethiopia
关键词
Amharic; Lexical model; Under-resourced language; Automatic speech recognition; LANGUAGE; ASR;
D O I
10.1007/s10579-023-09659-y
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Amharic is the second most spoken Semitic language after Arabic. It has its own syllabary writing system, each character representing a consonant and a vowel. Automatic Speech Recognition (ASR) researches for Amharic have been conducted on the basis of grapheme-based pronunciation lexicon, taking advantage of the nature of its writing system. However, the epenthetic vowel and the glottal stop consonant represented in the writing system may not be pronounced in all of their occurrences. Moreover, the writing system does not differentiate geminated and non-geminated forms of consonants. Therefore, the grapheme-based pronunciation lexicon used so far has limitations with regard to these language features. To handle these limitations, we have prepared word- and morpheme-based pronunciation lexicons using data-driven and knowledge-driven experts' transcription. The data-driven transcription has been used for the preparation of training pronunciation lexicon while the knowledge-driven has been used to prepare morpheme- and word-based pronunciation lexicons for decoding. When morpheme-based knowledge-driven lexicons are used, better ASR performance (compared with the baseline ASR system that used grapheme-based lexicon) has been achieved although the number of phones is much more (60) than the number of phones used in the grapheme-based lexicon (37).
引用
收藏
页码:963 / 984
页数:22
相关论文
共 50 条
  • [1] Lexical modeling for the development of Amharic automatic speech recognition systems
    Martha Yifiru Tachbelie
    Solomon Teferra Abate
    [J]. Language Resources and Evaluation, 2023, 57 : 963 - 984
  • [2] Using morphemes in language modeling and automatic speech recognition of Amharic
    Tachbelie, Martha Yifiru
    Abate, Solomon Teferra
    Menzel, Wolfgang
    [J]. NATURAL LANGUAGE ENGINEERING, 2014, 20 (02) : 235 - 259
  • [3] Lexical and Phonetic Modeling for Arabic Automatic Speech Recognition
    Nguyen, Long
    Ng, Tim
    Nguyen, Kham
    Zbib, Rabih
    Makhoul, John
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 708 - +
  • [4] Lexical modeling of non-native speech for automatic speech recognition
    Livescu, K
    Glass, J
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1683 - 1686
  • [5] Effect of Language Resources on Automatic Speech Recognition for Amharic
    Tachbelie, Martha Yifiru
    Abate, Solomon Teferra
    [J]. PROCEEDINGS OF THE 2015 12TH IEEE AFRICON INTERNATIONAL CONFERENCE - GREEN INNOVATION FOR AFRICAN RENAISSANCE (AFRICON), 2015,
  • [6] SPEECH DISFLUENCIES MODELING IN AUTOMATIC SPEECH RECOGNITION SYSTEMS
    Vasilisa, Verkhodanova O.
    Alexey, Karpov A.
    [J]. TOMSK STATE UNIVERSITY JOURNAL, 2012, (363): : 10 - +
  • [7] Automatic Speech Recognition for an Under-Resourced Language - Amharic
    Abate, Solomon Teferra
    Menzel, Wolfgang
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1737 - 1740
  • [8] Automatic Speech Recognition for an Under-Resourced Language - Amharic
    Abate, Solomon Teferra
    Menzel, Wolfgang
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 973 - 976
  • [9] INTEGRATED PRONUNCIATION LEARNING FOR AUTOMATIC SPEECH RECOGNITION USING PROBABILISTIC LEXICAL MODELING
    Rasipuram, Ramya
    Razavi, Marzieh
    Magimai-Doss, Mathew
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5176 - 5180
  • [10] Automatic speech recognition using probabilistic transcriptions in Swahili, Amharic, and Dinka
    Das, Amit
    Jyothi, Preethi
    Hasegawa-Johnson, Mark
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3524 - 3528