Effect of Language Resources on Automatic Speech Recognition for Amharic

被引:0
|
作者
Tachbelie, Martha Yifiru [1 ]
Abate, Solomon Teferra [1 ]
机构
[1] Univ Addis Ababa, Coll Nat Sci, Sch Informat Sci, Addis Ababa, Ethiopia
关键词
automatic speech recognition; Amharic; language resources; ASR;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents our investigation of the effect of language resources on the performance of Amharic speech recognition. We have used language model training text of different sizes and seen the effect on word error rate (WER) reduction. Moreover, we have investigated the effect of handling language issues (germination, epenthetic vowel insertion and glottal stop consonant pronunciation) on the performance of speech recognition systems using data-driven phone-level transcriptions. The results of our experiments show that only slight reduction in WER can be obtained by increasing language model training text. However, proper transcription of gemination, the epenthetic vowel and the glottal stop consonant did not bring performance improvement for Amharic speech recognition. This can be attributed to the larger number of phone HMM acoustic models (62 compared to 37 phone set of the grapheme-based phone-level transcriptions) trained with a small (5 hrs) training speech.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Automatic Speech Recognition for an Under-Resourced Language - Amharic
    Abate, Solomon Teferra
    Menzel, Wolfgang
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1737 - 1740
  • [2] Using morphemes in language modeling and automatic speech recognition of Amharic
    Tachbelie, Martha Yifiru
    Abate, Solomon Teferra
    Menzel, Wolfgang
    [J]. NATURAL LANGUAGE ENGINEERING, 2014, 20 (02) : 235 - 259
  • [3] Automatic Speech Recognition for an Under-Resourced Language - Amharic
    Abate, Solomon Teferra
    Menzel, Wolfgang
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 973 - 976
  • [4] Lexical modeling for the development of Amharic automatic speech recognition systems
    Tachbelie, Martha Yifiru
    Abate, Solomon Teferra
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (03) : 963 - 984
  • [5] Lexical modeling for the development of Amharic automatic speech recognition systems
    Martha Yifiru Tachbelie
    Solomon Teferra Abate
    [J]. Language Resources and Evaluation, 2023, 57 : 963 - 984
  • [6] Automatic speech recognition using probabilistic transcriptions in Swahili, Amharic, and Dinka
    Das, Amit
    Jyothi, Preethi
    Hasegawa-Johnson, Mark
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3524 - 3528
  • [7] Morphosyntactic resources for automatic speech recognition
    Huet, Stephane
    Gravier, Guillaume
    Sebillot, Pascale
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 692 - 698
  • [8] PRELIMINARIES TO AUTOMATIC RECOGNITION OF SPEECH - LANGUAGE IDENTIFICATION
    HOUSE, AS
    NEUBERG, EP
    WOHLFORD, RE
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 57 : S34 - S34
  • [9] LANGUAGE MODEL VERBALIZATION FOR AUTOMATIC SPEECH RECOGNITION
    Sak, Hasim
    Beaufays, Francoise
    Nakajima, Kaisuke
    Allauzen, Cyril
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8262 - 8266
  • [10] GEOGRAPHIC LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION
    Xiao, Xiaoqiang
    Chen, Hong
    Zylak, Mark
    Sosa, Daniela
    Desu, Suma
    Krishnamoorthy, Mahesh
    Liu, Daben
    Paulik, Matthias
    Zhang, Yuchen
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6124 - 6128