Lexical modeling for the development of Amharic automatic speech recognition systems

被引:0
|
作者
Tachbelie, Martha Yifiru [1 ]
Abate, Solomon Teferra [1 ]
机构
[1] Addis Ababa Univ, Sch Informat Sci, Addis Ababa, Ethiopia
关键词
Amharic; Lexical model; Under-resourced language; Automatic speech recognition; LANGUAGE; ASR;
D O I
10.1007/s10579-023-09659-y
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Amharic is the second most spoken Semitic language after Arabic. It has its own syllabary writing system, each character representing a consonant and a vowel. Automatic Speech Recognition (ASR) researches for Amharic have been conducted on the basis of grapheme-based pronunciation lexicon, taking advantage of the nature of its writing system. However, the epenthetic vowel and the glottal stop consonant represented in the writing system may not be pronounced in all of their occurrences. Moreover, the writing system does not differentiate geminated and non-geminated forms of consonants. Therefore, the grapheme-based pronunciation lexicon used so far has limitations with regard to these language features. To handle these limitations, we have prepared word- and morpheme-based pronunciation lexicons using data-driven and knowledge-driven experts' transcription. The data-driven transcription has been used for the preparation of training pronunciation lexicon while the knowledge-driven has been used to prepare morpheme- and word-based pronunciation lexicons for decoding. When morpheme-based knowledge-driven lexicons are used, better ASR performance (compared with the baseline ASR system that used grapheme-based lexicon) has been achieved although the number of phones is much more (60) than the number of phones used in the grapheme-based lexicon (37).
引用
收藏
页码:963 / 984
页数:22
相关论文
共 50 条
  • [31] AUTOMATIC SPEECH RECOGNITION AND MEDICAL EXPERT SYSTEMS
    NORWICH, KH
    LANDAU, JA
    [J]. CANADIAN MEDICAL AND BIOLOGICAL ENGINEERING SOCIETY CONFERENCE : PROCEEDINGS - 1989, 1989, : 57 - 58
  • [32] Automatic Speech Recognition System Development in the "Wild"
    Ragni, Anton
    Gales, Mark J. F.
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2217 - 2221
  • [33] Chhattisgarhi speech corpus for research and development in automatic speech recognition
    Londhe, Narendra D.
    Kshirsagar, Ghanahshyam B.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (02) : 193 - 210
  • [34] A Comprehensive Examination of Phoneme Recognition in Automatic Speech Recognition Systems
    Bhatt, Shobha
    Bansal, Shweta
    Kumar, Ankit
    Pandey, Saroj Kumar
    Ojha, Manoj Kumar
    Singh, Kamred Udham
    Chakraborty, Sanjay
    Singh, Teekam
    Swarup, Chetan
    [J]. TRAITEMENT DU SIGNAL, 2023, 40 (05) : 1997 - 2008
  • [35] EXPERIMENTAL TECHNIQUE FOR ESTABLISHING LEXICAL VARIANTS BY RULE IN AUTOMATIC RECOGNITION OF CONTINUOUS SPEECH
    TAPPERT, CC
    DIXON, NR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (01): : 355 - &
  • [36] DISCRIMINATIVE APPROACH TO LEXICAL ENTRY SELECTION FOR AUTOMATIC SPEECH RECOGNITION OF AGGLUTINATIVE LANGUAGE
    Ablimit, Mijit
    Kawahara, Tatsuya
    Hamdulla, Askar
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5009 - 5012
  • [37] A Decade of Discriminative Language Modeling for Automatic Speech Recognition
    Saraclar, Murat
    Dikici, Erinc
    Arisoy, Ebru
    [J]. SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 11 - 22
  • [38] An Evaluation of Structured Language Modeling for Automatic Speech Recognition
    Bjorklund, Johanna
    Cleophas, Loek
    Karlsson, My
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2017, 23 (11) : 1019 - 1034
  • [39] STATISTICAL MODELING AND AUTOMATIC PARAMETER ESTIMATION IN SPEECH RECOGNITION
    BAHL, LR
    BAKER, JK
    JELINEK, F
    MERCER, RL
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 59 : S96 - S96
  • [40] MODELING ERROR RECOVERY AND REPAIR IN AUTOMATIC SPEECH RECOGNITION
    BABER, C
    HONE, KS
    [J]. INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1993, 39 (03): : 495 - 515