THE CU-MFEC CORPUS FOR THAI AND ENGLISH SPELLING SPEECH RECOGNITION

被引：0

作者：

Kertkeidkachorn, Natthawut ^{[1
]}

Chanjaradwichai, Supadaech ^{[1
]}

Suri, Teera ^{[1
]}

Likitsupin, Krerksak ^{[1
]}

Vorapatratorn, Surapol ^{[1
]}

Hirankan, Pawanrat ^{[1
]}

Limpanadusadee, Worasa ^{[1
]}

Chuetanapinyo, Supakit ^{[1
]}

Pitakpawatkul, Kitanan ^{[1
]}

Puangsri, Natnarong ^{[1
]}

Tangsirirat, Nathacha ^{[1
]}

Trakulsuk, Konlawachara ^{[1
]}

Punyabukkana, Proadpran ^{[1
]}

Suchato, Atiwong ^{[1
]}

机构：

[1] Chulalongkorn Univ, Fac Engn, Dept Comp Engn, Spoken Language Syst Res Grp, Bangkok, Thailand

来源：

2012 INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS | 2012年

关键词：

Speech corpus; Thai spelling corpus; Automatic speech recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Much of the efficiency of any Automatic Speech Recognition (ASR) system depends on its speech corpus. This is even more so for recognizers designed for specific tasks. Naturally, an ASR for spelling recognition performs better if it is trained with a spelling speech corpus rather than a generic one. Although several speech corpora are available in Thai, we are still lack of Thai spelling speech corpora. This paper reports collection of experiences gained from constructing CU-MFEC, a Thai spelling speech corpus designed for form filling or other applications of similar nature. CU-MFEC corpus employed 100 speakers and encompassed 58 hours and 10 minutes of speech. There are four sets of the corpus; Alphabets with short pauses, Continuous free spelling, Sentences, and Numbers and commands. We evaluated its efficiency by utilizing CU-MFEC with speech recognition tasks and found the accuracy rate of 79.37% for spelling task and 54.92% for connected spelling task.

引用

页码：18 / 23

页数：6

共 50 条

[1] Speed compensation for improving Thai spelling recognition with a continuous speech corpus
Pisarn, C
Theeramunkong, T
[J]. INTELLIGENCE IN COMMUNICATION SYSTEMS, 2004, 3283 : 100 - 111
[2] Thai spelling analysis for automatic spelling speech recognition
Pisarn, Chutima
Theeramunkong, Thanaruk
[J]. INFORMATION SCIENCES, 2008, 178 (01) : 122 - 136
[3] Satja: Thai Elderly Speech Corpus for Speech Recognition
Prajongjai, Suphunnee
Triyason, Tuul
Mongkolnam, Pornchai
[J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION TECHNOLOGY (IAIT2018), 2018,
[4] An HMM-based method for Thai spelling speech recognition
Pisarn, C.
Theeramunkong, T.
[J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2007, 54 (01) : 76 - 95
[5] Multimodal English corpus for automatic speech recognition
Kunka, Bartosz
Kupryjanow, Adam
Dalka, Piotr
Bratoszewski, Piotr
Szczodrak, Maciej
Spaleniak, Pawel
Szykulski, Marcin
Czyzewski, Andrzej
[J]. 2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 106 - 111
[6] PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition
Taerungruang, Supawat
Taninpong, Phimphaka
Chunwijitra, Vataya
Thatphithakkul, Sumonmas
Kasuriya, Sawit
Inthanon, Viroj
Paksaranuwat, Pawat
Thumronglaohapun, Salinee
Nakharutai, Nawapon
Inkeaw, Papangkorn
Bootkrajang, Jakramate
[J]. COMPUTER SPEECH AND LANGUAGE, 2025, 89
[7] LibriVoxDeEn: A Corpus for German-to-English Speech Translation and German Speech Recognition
Beilharz, Benjamin
Sun, Xin
Karimova, Sariya
Riezler, Stefan
[J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3590 - 3594
[8] Constructing a Phonetic Transcribed Text Corpus for Southern Thai Dialect Speech Recognition
Aunkaew, Sittichok
Karnjanadecha, Montri
Wutiwiwatchai, Chai
[J]. PROCEEDINGS OF THE 2015 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2015, : 69 - 73
[9] DEVELOPING A THAI EMOTIONAL SPEECH CORPUS
Kasuriya, Sawit
Teeramunkong, Thanaruk
Wutiwiwatchai, Chai
[J]. 2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
[10] LOTUS-BI: a Thai-English Code-mixing Speech Corpus
Thatphithakkul, Sumonmas
Chunwijitra, Vataya
Sertsi, Phuttapong
Chootrakool, Patcharika
Kasuriya, Sawit
[J]. 2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 40 - 44

← 1 2 3 4 5 →