A NEW APPROACH TO DEVELOP A SYLLABLE BASED, CONTINUOUS AMHARIC SPEECH RECOGNIZER

被引:0
|
作者
Gebremedhin, Yitagessu B. [1 ]
Duckhorn, Frank [1 ]
Hoffmann, Ruediger [1 ]
Kraljevski, Ivan [1 ]
机构
[1] Tech Univ Dresden, Chair Syst Theory & Speech Technol, Dresden, Germany
来源
关键词
Amharic; ASR; UASR; CV-syllable; Finite State Transducers;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
All of the previous syllable based Automatic Speech Recognizers (ASRs) for the Amharic language are built by training a separate acoustic model for each of the 196 distinctly pronounced Consonant-Vowel (CV) syllable. In this paper, we will demonstrate that a smaller number of acoustic models are sufficient to build a syllable based, speaker independent, continuous, Amharic ASR. It is built for weather forecast and business report applications using the UASR (Unified Approach to Speech Synthesis and Recognition) Tool kit. A new speech corpus, which is of more than 35 hours duration, is used for training. It is a collection of corpora recorded in three different environments in order to make the recognizer less sensitive to recording environment and microphone changes. The grammar is finite state transducer based and the lexical model consists of thousands of words. Though acoustic models for only 93 syllables are trained, a recognition accuracy of 93.26% is achieved on a test set that has 4,000 words collected from 10 speakers.
引用
收藏
页码:1678 / 1683
页数:6
相关论文
共 50 条
  • [1] A SYLLABLE BASED CONTINUOUS SPEECH RECOGNIZER for TAMIL
    Lakshmi, A.
    Murthy, Hema A.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1878 - 1881
  • [2] Syllable Based Continuous Speech Recognizer With Varied Length Maximum Likelihood Character Segmentation
    Ganesh, Akila A.
    Ravichandran, Chandra
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 935 - 940
  • [3] SYLLABLE-BASED PROSODIC ANALYSIS OF AMHARIC READ SPEECH
    Jokisch, Oliver
    Birhanu, Yitagessu
    Hoffmann, Ruediger
    [J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 258 - 262
  • [4] Investigating The Use Of Syllable Acoustic Units For Amharic Speech Recognition
    Dribssa, Adey Edessa
    Tachbelie, Martha Yifiru
    [J]. PROCEEDINGS OF THE 2015 12TH IEEE AFRICON INTERNATIONAL CONFERENCE - GREEN INNOVATION FOR AFRICAN RENAISSANCE (AFRICON), 2015,
  • [5] SYLLABLE DETECTION IN CONTINUOUS SPEECH
    SARGENT, DC
    LI, KP
    FU, KS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (02): : 410 - 410
  • [6] Spotting glottal stop in Amharic in continuous speech
    Seid, Hussien
    Yegnanarayana, B.
    Rajendran, S.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2012, 26 (04): : 293 - 305
  • [7] ANEC: An Amharic Named Entity Corpus and Transformer Based Recognizer
    Jibril, Ebrahim Chekol
    Tantug, A. Cuneyd
    [J]. IEEE ACCESS, 2023, 11 : 15799 - 15815
  • [8] AN AUTOMATIC CAPTION-SUPERIMPOSING SYSTEM WITH A NEW CONTINUOUS SPEECH RECOGNIZER
    IMAI, T
    ANDO, A
    MIYASAKA, E
    [J]. IEEE TRANSACTIONS ON BROADCASTING, 1994, 40 (03) : 184 - 189
  • [9] Syllable-based large vocabulary continuous speech recognition
    Ganapathiraju, A
    Hamaker, J
    Picone, J
    Ordowski, M
    Doddington, GR
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (04): : 358 - 366
  • [10] PHRASE RECOGNIZER USING SYLLABLE-BASED ACOUSTIC MEASUREMENTS
    JOHNSON, DH
    WEINSTEIN, CJ
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1978, 26 (05): : 409 - 418