A NEW APPROACH TO DEVELOP A SYLLABLE BASED, CONTINUOUS AMHARIC SPEECH RECOGNIZER

被引：0

作者：

Gebremedhin, Yitagessu B. ^{[1
]}

Duckhorn, Frank ^{[1
]}

Hoffmann, Ruediger ^{[1
]}

Kraljevski, Ivan ^{[1
]}

机构：

[1] Tech Univ Dresden, Chair Syst Theory & Speech Technol, Dresden, Germany

来源：

2013 IEEE EUROCON | 2013年

关键词：

Amharic; ASR; UASR; CV-syllable; Finite State Transducers;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

All of the previous syllable based Automatic Speech Recognizers (ASRs) for the Amharic language are built by training a separate acoustic model for each of the 196 distinctly pronounced Consonant-Vowel (CV) syllable. In this paper, we will demonstrate that a smaller number of acoustic models are sufficient to build a syllable based, speaker independent, continuous, Amharic ASR. It is built for weather forecast and business report applications using the UASR (Unified Approach to Speech Synthesis and Recognition) Tool kit. A new speech corpus, which is of more than 35 hours duration, is used for training. It is a collection of corpora recorded in three different environments in order to make the recognizer less sensitive to recording environment and microphone changes. The grammar is finite state transducer based and the lexical model consists of thousands of words. Though acoustic models for only 93 syllables are trained, a recognition accuracy of 93.26% is achieved on a test set that has 4,000 words collected from 10 speakers.

引用

页码：1678 / 1683

页数：6

共 50 条

[1] A SYLLABLE BASED CONTINUOUS SPEECH RECOGNIZER for TAMIL
Lakshmi, A.
Murthy, Hema A.
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1878 - 1881
[2] Syllable Based Continuous Speech Recognizer With Varied Length Maximum Likelihood Character Segmentation
Ganesh, Akila A.
Ravichandran, Chandra
[J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 935 - 940
[3] SYLLABLE-BASED PROSODIC ANALYSIS OF AMHARIC READ SPEECH
Jokisch, Oliver
Birhanu, Yitagessu
Hoffmann, Ruediger
[J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 258 - 262
[4] Investigating The Use Of Syllable Acoustic Units For Amharic Speech Recognition
Dribssa, Adey Edessa
Tachbelie, Martha Yifiru
[J]. PROCEEDINGS OF THE 2015 12TH IEEE AFRICON INTERNATIONAL CONFERENCE - GREEN INNOVATION FOR AFRICAN RENAISSANCE (AFRICON), 2015,
[5] SYLLABLE DETECTION IN CONTINUOUS SPEECH
SARGENT, DC
LI, KP
FU, KS
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (02): : 410 - 410
[6] Spotting glottal stop in Amharic in continuous speech
Seid, Hussien
Yegnanarayana, B.
Rajendran, S.
[J]. COMPUTER SPEECH AND LANGUAGE, 2012, 26 (04): : 293 - 305
[7] ANEC: An Amharic Named Entity Corpus and Transformer Based Recognizer
Jibril, Ebrahim Chekol
Tantug, A. Cuneyd
[J]. IEEE ACCESS, 2023, 11 : 15799 - 15815
[8] AN AUTOMATIC CAPTION-SUPERIMPOSING SYSTEM WITH A NEW CONTINUOUS SPEECH RECOGNIZER
IMAI, T
ANDO, A
MIYASAKA, E
[J]. IEEE TRANSACTIONS ON BROADCASTING, 1994, 40 (03) : 184 - 189
[9] Syllable-based large vocabulary continuous speech recognition
Ganapathiraju, A
Hamaker, J
Picone, J
Ordowski, M
Doddington, GR
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (04): : 358 - 366
[10] PHRASE RECOGNIZER USING SYLLABLE-BASED ACOUSTIC MEASUREMENTS
JOHNSON, DH
WEINSTEIN, CJ
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1978, 26 (05): : 409 - 418

← 1 2 3 4 5 →