Evaluating Open-source Toolkits for Automatic Speech Recognition of South African Languages

被引:0
|
作者
Naidoo, Ashentha [1 ]
Tsoeu, Mohohlo [1 ]
机构
[1] Univ Cape Town, Dept Elect Engn, Cape Town, South Africa
基金
新加坡国家研究基金会;
关键词
automatic speech recognition; under-resourced; evaluation; languages; isiXhosa; English;
D O I
10.1109/robomech.2019.8704774
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Automatic speech recognition is a critical component of human language technologies. It concerns the translation of speech into textual data which can be processed by computers. Thus, it offers the creation of an intimate link allowing humans to interact with machines on a completely natural level. A variety of open-source toolkits exist for the development of these systems. These toolkits have been successfully implemented and tested for use on well-resourced languages. However, the same level of testing has not been performed for South African languages. This investigation sets out to evaluate popular open-source tools for South African languages and identify optimal toolkit configurations for each language and toolkit The NCHLT corpora were used to set up automatic speech recognition systems for English and isiXhosa using Kaldi, CMU Sphinx, and HTK. The word error rates achieved during this investigation showed that the best configurations from this investigation achieved better performance than those which were reported by the developers of the NCHLT corpus.
引用
收藏
页码:160 / 165
页数:6
相关论文
共 50 条
  • [1] Collecting and evaluating speech recognition corpora for 11 South African languages
    Badenhorst, Jaco
    van Heerden, Charl
    Davel, Marelie
    Barnard, Etienne
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2011, 45 (03) : 289 - 309
  • [2] Collecting and evaluating speech recognition corpora for 11 South African languages
    Jaco Badenhorst
    Charl van Heerden
    Marelie Davel
    Etienne Barnard
    [J]. Language Resources and Evaluation, 2011, 45 : 289 - 309
  • [3] Code-switched automatic speech recognition in five South African languages
    Biswas, Astik
    Yilmaz, Emre
    van der Westhuizen, Ewald
    de Wet, Febe
    Niesler, Thomas
    [J]. COMPUTER SPEECH AND LANGUAGE, 2022, 71
  • [4] A Study on Automatic Speech Recognition Toolkits
    Ganesh, D. Satya
    Sahu, Prasant Kumar
    [J]. 2015 INTERNATIONAL CONFERENCE ON MICROWAVE, OPTICAL AND COMMUNICATION ENGINEERING (ICMOCE), 2015, : 365 - 368
  • [5] THE BAVIECA OPEN-SOURCE SPEECH RECOGNITION TOOLKIT
    Bolanos, Daniel
    [J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 354 - 359
  • [6] Using Open-Source Automatic Speech Recognition Tools for the Annotation of Dutch Infant-Directed Speech
    van der Klis, Anika
    Adriaans, Frans
    Han, Mengru
    Kager, Rene
    [J]. MULTIMODAL TECHNOLOGIES AND INTERACTION, 2023, 7 (07)
  • [7] Open-Source Text to Speech Synthesis System for Iberian Languages
    Alonso, Austin
    Sainzi, Inaki
    Erro, Daniel
    Navasi, Eva
    Hernaez, Inma
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2013, (51): : 169 - 175
  • [8] Automatic Speech Recognition for African Languages with Vowel Length Contrast
    Gauthier, Elodie
    Besacier, Laurent
    Voisin, Sylvie
    [J]. SLTU-2016 5TH WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGIES FOR UNDER-RESOURCED LANGUAGES, 2016, 81 : 136 - 143
  • [9] Speech Recognition System Using Open-Source Speech Engine for Indian Names
    Kallole, Nitin Arun
    Prakash, R.
    [J]. INTELLIGENT EMBEDDED SYSTEMS, ICNETS2, VOL II, 2018, 492 : 263 - 274
  • [10] A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
    Khassanov, Yerbolat
    Mussakhojayeva, Saida
    Mirzakhmetov, Almas
    Adiyev, Alen
    Nurpeiissov, Mukhamet
    Varol, Huseyin Atakan
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 697 - 706