THE CU-MFEC CORPUS FOR THAI AND ENGLISH SPELLING SPEECH RECOGNITION

被引:0
|
作者
Kertkeidkachorn, Natthawut [1 ]
Chanjaradwichai, Supadaech [1 ]
Suri, Teera [1 ]
Likitsupin, Krerksak [1 ]
Vorapatratorn, Surapol [1 ]
Hirankan, Pawanrat [1 ]
Limpanadusadee, Worasa [1 ]
Chuetanapinyo, Supakit [1 ]
Pitakpawatkul, Kitanan [1 ]
Puangsri, Natnarong [1 ]
Tangsirirat, Nathacha [1 ]
Trakulsuk, Konlawachara [1 ]
Punyabukkana, Proadpran [1 ]
Suchato, Atiwong [1 ]
机构
[1] Chulalongkorn Univ, Fac Engn, Dept Comp Engn, Spoken Language Syst Res Grp, Bangkok, Thailand
关键词
Speech corpus; Thai spelling corpus; Automatic speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Much of the efficiency of any Automatic Speech Recognition (ASR) system depends on its speech corpus. This is even more so for recognizers designed for specific tasks. Naturally, an ASR for spelling recognition performs better if it is trained with a spelling speech corpus rather than a generic one. Although several speech corpora are available in Thai, we are still lack of Thai spelling speech corpora. This paper reports collection of experiences gained from constructing CU-MFEC, a Thai spelling speech corpus designed for form filling or other applications of similar nature. CU-MFEC corpus employed 100 speakers and encompassed 58 hours and 10 minutes of speech. There are four sets of the corpus; Alphabets with short pauses, Continuous free spelling, Sentences, and Numbers and commands. We evaluated its efficiency by utilizing CU-MFEC with speech recognition tasks and found the accuracy rate of 79.37% for spelling task and 54.92% for connected spelling task.
引用
收藏
页码:18 / 23
页数:6
相关论文
共 50 条
  • [1] Speed compensation for improving Thai spelling recognition with a continuous speech corpus
    Pisarn, C
    Theeramunkong, T
    [J]. INTELLIGENCE IN COMMUNICATION SYSTEMS, 2004, 3283 : 100 - 111
  • [2] Thai spelling analysis for automatic spelling speech recognition
    Pisarn, Chutima
    Theeramunkong, Thanaruk
    [J]. INFORMATION SCIENCES, 2008, 178 (01) : 122 - 136
  • [3] Satja: Thai Elderly Speech Corpus for Speech Recognition
    Prajongjai, Suphunnee
    Triyason, Tuul
    Mongkolnam, Pornchai
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION TECHNOLOGY (IAIT2018), 2018,
  • [4] An HMM-based method for Thai spelling speech recognition
    Pisarn, C.
    Theeramunkong, T.
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2007, 54 (01) : 76 - 95
  • [5] Multimodal English corpus for automatic speech recognition
    Kunka, Bartosz
    Kupryjanow, Adam
    Dalka, Piotr
    Bratoszewski, Piotr
    Szczodrak, Maciej
    Spaleniak, Pawel
    Szykulski, Marcin
    Czyzewski, Andrzej
    [J]. 2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 106 - 111
  • [6] PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition
    Taerungruang, Supawat
    Taninpong, Phimphaka
    Chunwijitra, Vataya
    Thatphithakkul, Sumonmas
    Kasuriya, Sawit
    Inthanon, Viroj
    Paksaranuwat, Pawat
    Thumronglaohapun, Salinee
    Nakharutai, Nawapon
    Inkeaw, Papangkorn
    Bootkrajang, Jakramate
    [J]. COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [7] LibriVoxDeEn: A Corpus for German-to-English Speech Translation and German Speech Recognition
    Beilharz, Benjamin
    Sun, Xin
    Karimova, Sariya
    Riezler, Stefan
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3590 - 3594
  • [8] Constructing a Phonetic Transcribed Text Corpus for Southern Thai Dialect Speech Recognition
    Aunkaew, Sittichok
    Karnjanadecha, Montri
    Wutiwiwatchai, Chai
    [J]. PROCEEDINGS OF THE 2015 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2015, : 69 - 73
  • [9] DEVELOPING A THAI EMOTIONAL SPEECH CORPUS
    Kasuriya, Sawit
    Teeramunkong, Thanaruk
    Wutiwiwatchai, Chai
    [J]. 2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [10] LOTUS-BI: a Thai-English Code-mixing Speech Corpus
    Thatphithakkul, Sumonmas
    Chunwijitra, Vataya
    Sertsi, Phuttapong
    Chootrakool, Patcharika
    Kasuriya, Sawit
    [J]. 2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 40 - 44