Lexical modeling for the development of Amharic automatic speech recognition systems

被引：0

作者：

Tachbelie, Martha Yifiru ^{[1
]}

Abate, Solomon Teferra ^{[1
]}

机构：

[1] Addis Ababa Univ, Sch Informat Sci, Addis Ababa, Ethiopia

来源：

LANGUAGE RESOURCES AND EVALUATION | 2023年 / 57卷 / 03期

关键词：

Amharic; Lexical model; Under-resourced language; Automatic speech recognition; LANGUAGE; ASR;

D O I：

10.1007/s10579-023-09659-y

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Amharic is the second most spoken Semitic language after Arabic. It has its own syllabary writing system, each character representing a consonant and a vowel. Automatic Speech Recognition (ASR) researches for Amharic have been conducted on the basis of grapheme-based pronunciation lexicon, taking advantage of the nature of its writing system. However, the epenthetic vowel and the glottal stop consonant represented in the writing system may not be pronounced in all of their occurrences. Moreover, the writing system does not differentiate geminated and non-geminated forms of consonants. Therefore, the grapheme-based pronunciation lexicon used so far has limitations with regard to these language features. To handle these limitations, we have prepared word- and morpheme-based pronunciation lexicons using data-driven and knowledge-driven experts' transcription. The data-driven transcription has been used for the preparation of training pronunciation lexicon while the knowledge-driven has been used to prepare morpheme- and word-based pronunciation lexicons for decoding. When morpheme-based knowledge-driven lexicons are used, better ASR performance (compared with the baseline ASR system that used grapheme-based lexicon) has been achieved although the number of phones is much more (60) than the number of phones used in the grapheme-based lexicon (37).

引用

页码：963 / 984

页数：22

共 50 条

[31] AUTOMATIC SPEECH RECOGNITION AND MEDICAL EXPERT SYSTEMS
NORWICH, KH
LANDAU, JA
[J]. CANADIAN MEDICAL AND BIOLOGICAL ENGINEERING SOCIETY CONFERENCE : PROCEEDINGS - 1989, 1989, : 57 - 58
[32] Automatic Speech Recognition System Development in the "Wild"
Ragni, Anton
Gales, Mark J. F.
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2217 - 2221
[33] Chhattisgarhi speech corpus for research and development in automatic speech recognition
Londhe, Narendra D.
Kshirsagar, Ghanahshyam B.
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (02) : 193 - 210
[34] A Comprehensive Examination of Phoneme Recognition in Automatic Speech Recognition Systems
Bhatt, Shobha
Bansal, Shweta
Kumar, Ankit
Pandey, Saroj Kumar
Ojha, Manoj Kumar
Singh, Kamred Udham
Chakraborty, Sanjay
Singh, Teekam
Swarup, Chetan
[J]. TRAITEMENT DU SIGNAL, 2023, 40 (05) : 1997 - 2008
[35] EXPERIMENTAL TECHNIQUE FOR ESTABLISHING LEXICAL VARIANTS BY RULE IN AUTOMATIC RECOGNITION OF CONTINUOUS SPEECH
TAPPERT, CC
DIXON, NR
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (01): : 355 - &
[36] DISCRIMINATIVE APPROACH TO LEXICAL ENTRY SELECTION FOR AUTOMATIC SPEECH RECOGNITION OF AGGLUTINATIVE LANGUAGE
Ablimit, Mijit
Kawahara, Tatsuya
Hamdulla, Askar
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5009 - 5012
[37] A Decade of Discriminative Language Modeling for Automatic Speech Recognition
Saraclar, Murat
Dikici, Erinc
Arisoy, Ebru
[J]. SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 11 - 22
[38] An Evaluation of Structured Language Modeling for Automatic Speech Recognition
Bjorklund, Johanna
Cleophas, Loek
Karlsson, My
[J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2017, 23 (11) : 1019 - 1034
[39] STATISTICAL MODELING AND AUTOMATIC PARAMETER ESTIMATION IN SPEECH RECOGNITION
BAHL, LR
BAKER, JK
JELINEK, F
MERCER, RL
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 59 : S96 - S96
[40] MODELING ERROR RECOVERY AND REPAIR IN AUTOMATIC SPEECH RECOGNITION
BABER, C
HONE, KS
[J]. INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1993, 39 (03): : 495 - 515

← 1 2 3 4 5 →