Effect of Language Resources on Automatic Speech Recognition for Amharic

被引：0

作者：

Tachbelie, Martha Yifiru ^{[1
]}

Abate, Solomon Teferra ^{[1
]}

机构：

[1] Univ Addis Ababa, Coll Nat Sci, Sch Informat Sci, Addis Ababa, Ethiopia

来源：

PROCEEDINGS OF THE 2015 12TH IEEE AFRICON INTERNATIONAL CONFERENCE - GREEN INNOVATION FOR AFRICAN RENAISSANCE (AFRICON) | 2015年

关键词：

automatic speech recognition; Amharic; language resources; ASR;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents our investigation of the effect of language resources on the performance of Amharic speech recognition. We have used language model training text of different sizes and seen the effect on word error rate (WER) reduction. Moreover, we have investigated the effect of handling language issues (germination, epenthetic vowel insertion and glottal stop consonant pronunciation) on the performance of speech recognition systems using data-driven phone-level transcriptions. The results of our experiments show that only slight reduction in WER can be obtained by increasing language model training text. However, proper transcription of gemination, the epenthetic vowel and the glottal stop consonant did not bring performance improvement for Amharic speech recognition. This can be attributed to the larger number of phone HMM acoustic models (62 compared to 37 phone set of the grapheme-based phone-level transcriptions) trained with a small (5 hrs) training speech.

引用

页数：5

共 50 条

[1] Automatic Speech Recognition for an Under-Resourced Language - Amharic
Abate, Solomon Teferra
Menzel, Wolfgang
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1737 - 1740
[2] Using morphemes in language modeling and automatic speech recognition of Amharic
Tachbelie, Martha Yifiru
Abate, Solomon Teferra
Menzel, Wolfgang
[J]. NATURAL LANGUAGE ENGINEERING, 2014, 20 (02) : 235 - 259
[3] Automatic Speech Recognition for an Under-Resourced Language - Amharic
Abate, Solomon Teferra
Menzel, Wolfgang
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 973 - 976
[4] Lexical modeling for the development of Amharic automatic speech recognition systems
Tachbelie, Martha Yifiru
Abate, Solomon Teferra
[J]. LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (03) : 963 - 984
[5] Lexical modeling for the development of Amharic automatic speech recognition systems
Martha Yifiru Tachbelie
Solomon Teferra Abate
[J]. Language Resources and Evaluation, 2023, 57 : 963 - 984
[6] Automatic speech recognition using probabilistic transcriptions in Swahili, Amharic, and Dinka
Das, Amit
Jyothi, Preethi
Hasegawa-Johnson, Mark
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3524 - 3528
[7] Morphosyntactic resources for automatic speech recognition
Huet, Stephane
Gravier, Guillaume
Sebillot, Pascale
[J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 692 - 698
[8] PRELIMINARIES TO AUTOMATIC RECOGNITION OF SPEECH - LANGUAGE IDENTIFICATION
HOUSE, AS
NEUBERG, EP
WOHLFORD, RE
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 57 : S34 - S34
[9] LANGUAGE MODEL VERBALIZATION FOR AUTOMATIC SPEECH RECOGNITION
Sak, Hasim
Beaufays, Francoise
Nakajima, Kaisuke
Allauzen, Cyril
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8262 - 8266
[10] GEOGRAPHIC LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION
Xiao, Xiaoqiang
Chen, Hong
Zylak, Mark
Sosa, Daniela
Desu, Suma
Krishnamoorthy, Mahesh
Liu, Daben
Paulik, Matthias
Zhang, Yuchen
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6124 - 6128

← 1 2 3 4 5 →