Lexical modeling for the development of Amharic automatic speech recognition systems

被引：0

作者：

Tachbelie, Martha Yifiru ^{[1
]}

Abate, Solomon Teferra ^{[1
]}

机构：

[1] Addis Ababa Univ, Sch Informat Sci, Addis Ababa, Ethiopia

来源：

LANGUAGE RESOURCES AND EVALUATION | 2023年 / 57卷 / 03期

关键词：

Amharic; Lexical model; Under-resourced language; Automatic speech recognition; LANGUAGE; ASR;

D O I：

10.1007/s10579-023-09659-y

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Amharic is the second most spoken Semitic language after Arabic. It has its own syllabary writing system, each character representing a consonant and a vowel. Automatic Speech Recognition (ASR) researches for Amharic have been conducted on the basis of grapheme-based pronunciation lexicon, taking advantage of the nature of its writing system. However, the epenthetic vowel and the glottal stop consonant represented in the writing system may not be pronounced in all of their occurrences. Moreover, the writing system does not differentiate geminated and non-geminated forms of consonants. Therefore, the grapheme-based pronunciation lexicon used so far has limitations with regard to these language features. To handle these limitations, we have prepared word- and morpheme-based pronunciation lexicons using data-driven and knowledge-driven experts' transcription. The data-driven transcription has been used for the preparation of training pronunciation lexicon while the knowledge-driven has been used to prepare morpheme- and word-based pronunciation lexicons for decoding. When morpheme-based knowledge-driven lexicons are used, better ASR performance (compared with the baseline ASR system that used grapheme-based lexicon) has been achieved although the number of phones is much more (60) than the number of phones used in the grapheme-based lexicon (37).

引用

页码：963 / 984

页数：22

共 50 条

[1] Lexical modeling for the development of Amharic automatic speech recognition systems
Martha Yifiru Tachbelie
Solomon Teferra Abate
[J]. Language Resources and Evaluation, 2023, 57 : 963 - 984
[2] Using morphemes in language modeling and automatic speech recognition of Amharic
Tachbelie, Martha Yifiru
Abate, Solomon Teferra
Menzel, Wolfgang
[J]. NATURAL LANGUAGE ENGINEERING, 2014, 20 (02) : 235 - 259
[3] Lexical and Phonetic Modeling for Arabic Automatic Speech Recognition
Nguyen, Long
Ng, Tim
Nguyen, Kham
Zbib, Rabih
Makhoul, John
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 708 - +
[4] Lexical modeling of non-native speech for automatic speech recognition
Livescu, K
Glass, J
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1683 - 1686
[5] Effect of Language Resources on Automatic Speech Recognition for Amharic
Tachbelie, Martha Yifiru
Abate, Solomon Teferra
[J]. PROCEEDINGS OF THE 2015 12TH IEEE AFRICON INTERNATIONAL CONFERENCE - GREEN INNOVATION FOR AFRICAN RENAISSANCE (AFRICON), 2015,
[6] SPEECH DISFLUENCIES MODELING IN AUTOMATIC SPEECH RECOGNITION SYSTEMS
Vasilisa, Verkhodanova O.
Alexey, Karpov A.
[J]. TOMSK STATE UNIVERSITY JOURNAL, 2012, (363): : 10 - +
[7] Automatic Speech Recognition for an Under-Resourced Language - Amharic
Abate, Solomon Teferra
Menzel, Wolfgang
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1737 - 1740
[8] Automatic Speech Recognition for an Under-Resourced Language - Amharic
Abate, Solomon Teferra
Menzel, Wolfgang
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 973 - 976
[9] INTEGRATED PRONUNCIATION LEARNING FOR AUTOMATIC SPEECH RECOGNITION USING PROBABILISTIC LEXICAL MODELING
Rasipuram, Ramya
Razavi, Marzieh
Magimai-Doss, Mathew
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5176 - 5180
[10] Automatic speech recognition using probabilistic transcriptions in Swahili, Amharic, and Dinka
Das, Amit
Jyothi, Preethi
Hasegawa-Johnson, Mark
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3524 - 3528

← 1 2 3 4 5 →