Satja: Thai Elderly Speech Corpus for Speech Recognition

被引:0
|
作者
Prajongjai, Suphunnee [1 ]
Triyason, Tuul [1 ]
Mongkolnam, Pornchai [1 ]
机构
[1] King Mongkuts Univ Technol Thonburi, Sch Informat Technol, Bangkok, Thailand
关键词
Speech corpus development; Speech recognition system; Thai language; Elderly;
D O I
10.1145/3291280.3291793
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Thai language is the official language of Thailand. At present, about 70 million speakers are located in Thailand and the southern parts of China, Yunnan, Guizhou, and Guangxi. The Thai language is a tonal language. Thai Language is a challenging language for speech processing technology. Because the Thai spoken language database is limited and also lacks a specific speech corpus, such as a children's speech database, elderly speech, accents spoken in each region, etc. This research develops the Thai elderly speech named Satja meaning is truth of speech. The content of this corpus is a voice command There are 50 speakers, 24 males and 26 females, covering six regions in Thailand, aged 60-85 years. In addition, the database of elderly voice was compared to non-elderly voice. For a model training, we used CMUSphinx and tested with Sphinx4. We found that when the elderly speech was tested with the elderly model, it was more accurate when experimented than the model trained by the non-elderly people.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] DEVELOPMENT OF NEW SPEECH CORPUS FOR ELDERLY JAPANESE SPEECH RECOGNITION
    Iribe, Yurie
    Kitaoka, Norihide
    Segawa, Shuhei
    [J]. 2015 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2015 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2015, : 27 - 31
  • [2] Improving Speech Recognition for the Elderly: A New Corpus of Elderly Japanese Speech and Investigation of Acoustic Modeling for Speech Recognition
    Fukuda, Meiko
    Nishizaki, Hiromitsu
    Iribe, Yurie
    Nishimura, Ryota
    Kitaoka, Norihide
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6578 - 6585
  • [3] Construction of a Corpus for Elderly Japanese Speech Recognition
    Fukuda, Meiko
    Nishimura, Ryota
    Kitaoka, Norihide
    Nishizaki, Hiromitsu
    Iribe, Yurie
    [J]. 2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 687 - 688
  • [4] PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition
    Taerungruang, Supawat
    Taninpong, Phimphaka
    Chunwijitra, Vataya
    Thatphithakkul, Sumonmas
    Kasuriya, Sawit
    Inthanon, Viroj
    Paksaranuwat, Pawat
    Thumronglaohapun, Salinee
    Nakharutai, Nawapon
    Inkeaw, Papangkorn
    Bootkrajang, Jakramate
    [J]. COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [5] THE CU-MFEC CORPUS FOR THAI AND ENGLISH SPELLING SPEECH RECOGNITION
    Kertkeidkachorn, Natthawut
    Chanjaradwichai, Supadaech
    Suri, Teera
    Likitsupin, Krerksak
    Vorapatratorn, Surapol
    Hirankan, Pawanrat
    Limpanadusadee, Worasa
    Chuetanapinyo, Supakit
    Pitakpawatkul, Kitanan
    Puangsri, Natnarong
    Tangsirirat, Nathacha
    Trakulsuk, Konlawachara
    Punyabukkana, Proadpran
    Suchato, Atiwong
    [J]. 2012 INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2012, : 18 - 23
  • [6] Speed compensation for improving Thai spelling recognition with a continuous speech corpus
    Pisarn, C
    Theeramunkong, T
    [J]. INTELLIGENCE IN COMMUNICATION SYSTEMS, 2004, 3283 : 100 - 111
  • [7] DEVELOPING A THAI EMOTIONAL SPEECH CORPUS
    Kasuriya, Sawit
    Teeramunkong, Thanaruk
    Wutiwiwatchai, Chai
    [J]. 2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [8] Thai automatic speech recognition
    Suebvisai, S
    Charoenpomsawat, P
    Black, A
    Woszczyna, M
    Schultz, T
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 857 - 860
  • [9] Spontaneous Thai Speech Recognition
    Woszczyna, Monika
    Charoenpornsawat, Paisarn
    Schultz, Tanja
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1882 - 1885
  • [10] An open and free Speech Corpus for Speaker Recognition: The FSCSR Speech Corpus
    Bouziane, Ayoub
    Kadi, Houda
    Hourri, Soufiane
    Kharroubi, Jamal
    [J]. 2016 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2016,