Lexical modeling for the development of Amharic automatic speech recognition systems

被引:0
|
作者
Tachbelie, Martha Yifiru [1 ]
Abate, Solomon Teferra [1 ]
机构
[1] Addis Ababa Univ, Sch Informat Sci, Addis Ababa, Ethiopia
关键词
Amharic; Lexical model; Under-resourced language; Automatic speech recognition; LANGUAGE; ASR;
D O I
10.1007/s10579-023-09659-y
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Amharic is the second most spoken Semitic language after Arabic. It has its own syllabary writing system, each character representing a consonant and a vowel. Automatic Speech Recognition (ASR) researches for Amharic have been conducted on the basis of grapheme-based pronunciation lexicon, taking advantage of the nature of its writing system. However, the epenthetic vowel and the glottal stop consonant represented in the writing system may not be pronounced in all of their occurrences. Moreover, the writing system does not differentiate geminated and non-geminated forms of consonants. Therefore, the grapheme-based pronunciation lexicon used so far has limitations with regard to these language features. To handle these limitations, we have prepared word- and morpheme-based pronunciation lexicons using data-driven and knowledge-driven experts' transcription. The data-driven transcription has been used for the preparation of training pronunciation lexicon while the knowledge-driven has been used to prepare morpheme- and word-based pronunciation lexicons for decoding. When morpheme-based knowledge-driven lexicons are used, better ASR performance (compared with the baseline ASR system that used grapheme-based lexicon) has been achieved although the number of phones is much more (60) than the number of phones used in the grapheme-based lexicon (37).
引用
收藏
页码:963 / 984
页数:22
相关论文
共 50 条
  • [41] Improved Acoustic Modeling for Automatic Dysarthric Speech Recognition
    Sriranjani, R.
    Reddy, M. Ramasubba
    Umesh, S.
    [J]. 2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [42] A New Bidirectional Neural Network for Lexical Modeling and Speech Recognition Improvement
    Yazdchi, M. R.
    Salehi, S. A. Seyyed
    Zafarani, R.
    [J]. SCIENTIA IRANICA, 2007, 14 (06) : 571 - 578
  • [43] Investigating The Use Of Syllable Acoustic Units For Amharic Speech Recognition
    Dribssa, Adey Edessa
    Tachbelie, Martha Yifiru
    [J]. PROCEEDINGS OF THE 2015 12TH IEEE AFRICON INTERNATIONAL CONFERENCE - GREEN INNOVATION FOR AFRICAN RENAISSANCE (AFRICON), 2015,
  • [44] Comparing Humans and Automatic Speech Recognition Systems in Recognizing Dysarthric Speech
    Mengistu, Kinfe Tadesse
    Rudzicz, Frank
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 6657 : 291 - 300
  • [45] Systems for Low-Resource Speech Recognition Tasks in Open Automatic Speech Recognition and Formosa Speech Recognition Challenges
    Lin, Hung-Pang
    Zhang, Yu-Jia
    Chen, Chia-Ping
    [J]. INTERSPEECH 2021, 2021, : 4339 - 4343
  • [46] Applications of automatic speech recognition to speech and language development in young children
    Russell, M
    Brown, C
    Skilling, A
    Series, R
    Wallace, J
    Bonham, B
    Barker, P
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 176 - 179
  • [47] DEPLOYABLE AUTOMATIC SPEECH RECOGNITION SYSTEMS - ADVANCES AND CHALLENGES
    JUANG, BH
    PERDUE, RJ
    THOMSON, DL
    [J]. AT&T TECHNICAL JOURNAL, 1995, 74 (02): : 45 - 56
  • [48] Automatic generation of subword units for speech recognition systems
    Singh, R
    Raj, B
    Stern, RM
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (02): : 89 - 99
  • [49] Automatic speech recognition systems: A survey of discriminative techniques
    Kaur, Amrit Preet
    Singh, Amitoj
    Sachdeva, Rohit
    Kukreja, Vinay
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 13307 - 13339
  • [50] SYNTHESIS OF PRACTICAL AUTOMATIC SPEECH-RECOGNITION SYSTEMS
    TRUNINDONSKOI, VN
    [J]. SOVIET PHYSICS ACOUSTICS-USSR, 1978, 24 (01): : 91 - 92