Evaluating Open-source Toolkits for Automatic Speech Recognition of South African Languages

被引：0

作者：

Naidoo, Ashentha ^{[1
]}

Tsoeu, Mohohlo ^{[1
]}

机构：

[1] Univ Cape Town, Dept Elect Engn, Cape Town, South Africa

来源：

2019 SOUTHERN AFRICAN UNIVERSITIES POWER ENGINEERING CONFERENCE/ROBOTICS AND MECHATRONICS/PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA (SAUPEC/ROBMECH/PRASA) | 2019年

基金：

新加坡国家研究基金会;

关键词：

automatic speech recognition; under-resourced; evaluation; languages; isiXhosa; English;

D O I：

10.1109/robomech.2019.8704774

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Automatic speech recognition is a critical component of human language technologies. It concerns the translation of speech into textual data which can be processed by computers. Thus, it offers the creation of an intimate link allowing humans to interact with machines on a completely natural level. A variety of open-source toolkits exist for the development of these systems. These toolkits have been successfully implemented and tested for use on well-resourced languages. However, the same level of testing has not been performed for South African languages. This investigation sets out to evaluate popular open-source tools for South African languages and identify optimal toolkit configurations for each language and toolkit The NCHLT corpora were used to set up automatic speech recognition systems for English and isiXhosa using Kaldi, CMU Sphinx, and HTK. The word error rates achieved during this investigation showed that the best configurations from this investigation achieved better performance than those which were reported by the developers of the NCHLT corpus.

引用

页码：160 / 165

页数：6

共 50 条

[1] Collecting and evaluating speech recognition corpora for 11 South African languages
Badenhorst, Jaco
van Heerden, Charl
Davel, Marelie
Barnard, Etienne
[J]. LANGUAGE RESOURCES AND EVALUATION, 2011, 45 (03) : 289 - 309
[2] Collecting and evaluating speech recognition corpora for 11 South African languages
Jaco Badenhorst
Charl van Heerden
Marelie Davel
Etienne Barnard
[J]. Language Resources and Evaluation, 2011, 45 : 289 - 309
[3] Code-switched automatic speech recognition in five South African languages
Biswas, Astik
Yilmaz, Emre
van der Westhuizen, Ewald
de Wet, Febe
Niesler, Thomas
[J]. COMPUTER SPEECH AND LANGUAGE, 2022, 71
[4] A Study on Automatic Speech Recognition Toolkits
Ganesh, D. Satya
Sahu, Prasant Kumar
[J]. 2015 INTERNATIONAL CONFERENCE ON MICROWAVE, OPTICAL AND COMMUNICATION ENGINEERING (ICMOCE), 2015, : 365 - 368
[5] THE BAVIECA OPEN-SOURCE SPEECH RECOGNITION TOOLKIT
Bolanos, Daniel
[J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 354 - 359
[6] Using Open-Source Automatic Speech Recognition Tools for the Annotation of Dutch Infant-Directed Speech
van der Klis, Anika
Adriaans, Frans
Han, Mengru
Kager, Rene
[J]. MULTIMODAL TECHNOLOGIES AND INTERACTION, 2023, 7 (07)
[7] Open-Source Text to Speech Synthesis System for Iberian Languages
Alonso, Austin
Sainzi, Inaki
Erro, Daniel
Navasi, Eva
Hernaez, Inma
[J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2013, (51): : 169 - 175
[8] Automatic Speech Recognition for African Languages with Vowel Length Contrast
Gauthier, Elodie
Besacier, Laurent
Voisin, Sylvie
[J]. SLTU-2016 5TH WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGIES FOR UNDER-RESOURCED LANGUAGES, 2016, 81 : 136 - 143
[9] A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Khassanov, Yerbolat
Mussakhojayeva, Saida
Mirzakhmetov, Almas
Adiyev, Alen
Nurpeiissov, Mukhamet
Varol, Huseyin Atakan
[J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 697 - 706
[10] Speech Recognition System Using Open-Source Speech Engine for Indian Names
Kallole, Nitin Arun
Prakash, R.
[J]. INTELLIGENT EMBEDDED SYSTEMS, ICNETS2, VOL II, 2018, 492 : 263 - 274

← 1 2 3 4 5 →