Boosting Thai Syllable Speech Recognition Using Acoustic Models Combination

被引:4
|
作者
Tangwongsan, Supachai [1 ]
Phoophuangpairoj, Rong [1 ]
机构
[1] Mahidol Univ, Dept Comp Sci, Bangkok 10400, Thailand
关键词
D O I
10.1109/ICCEE.2008.130
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a highly effective system for Thai speech recognition is proposed. The speech recognizer for so-called speaker-independent is created by using Continuous Density Hidden Markov Model (CDHMM). In the acoustic level, the models trained for both speaker genders, and for each separate gender are investigated and tested in terms Of accuracy. Experimental evaluation shows that with the acoustic models combination, the accuracy could be improved considerably in the acoustic level. The acoustic combination can support spoken utterances from both genders and still provide the high accuracy simultaneously. Interestingly, when using the acoustic models combination, the syllable accuracy of 89.84% is achieved with 4.53% improvement over using the conventional acoustic models trained for both genders.
引用
收藏
页码:568 / 572
页数:5
相关论文
共 50 条
  • [1] Boosting acoustic models in large vocabulary speech recognition
    Meyer, C
    Schramm, H
    [J]. PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2004, : 255 - 260
  • [2] Context-independent acoustic models for Thai speech recognition
    Kasuriya, S
    Kanokphara, S
    Thatphithakkul, N
    Cotsomrong, P
    Sunpethniyom, T
    [J]. IEEE INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2004 (ISCIT 2004), PROCEEDINGS, VOLS 1 AND 2: SMART INFO-MEDIA SYSTEMS, 2004, : 991 - 994
  • [3] Boosting HMM acoustic models in large vocabulary speech recognition
    Meyer, C
    Schramm, H
    [J]. SPEECH COMMUNICATION, 2006, 48 (05) : 532 - 548
  • [4] Trajectory clustering of syllable-length acoustic models for continuous speech recognition
    Han, Yan
    Hamalainen, Annika
    Boves, Lou
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1169 - 1172
  • [5] Dual stream speech recognition using articulatory syllable models
    Puurula, Antti
    Van Compernolle, Dirk
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2010, 13 (04) : 219 - 230
  • [6] Highly efficient and effective techniques for Thai syllable speech recognition
    Tangwongsan, S
    Po-Aramsri, P
    Phoophuangpairoj, R
    [J]. ADVANCES IN COMPUTER SCIENCE - ASIAN 2004, PROCEEDINGS, 2004, 3321 : 259 - 270
  • [7] Multimodal Data Fusion of Electromyography and Acoustic Signals for Thai Syllable Recognition
    Jong, Nida Sae
    de Herrera, Alba Garcia Seco
    Phukpattaranont, Pornchai
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (06) : 1997 - 2006
  • [8] Investigating The Use Of Syllable Acoustic Units For Amharic Speech Recognition
    Dribssa, Adey Edessa
    Tachbelie, Martha Yifiru
    [J]. PROCEEDINGS OF THE 2015 12TH IEEE AFRICON INTERNATIONAL CONFERENCE - GREEN INNOVATION FOR AFRICAN RENAISSANCE (AFRICON), 2015,
  • [9] Context Dependent Syllable Acoustic Model for Continuous Chinese Speech Recognition
    Wu, Hao
    Wu, Xihong
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1961 - 1964
  • [10] TASK ADAPTATION IN SYLLABLE TRIGRAM MODELS FOR CONTINUOUS SPEECH RECOGNITION
    MATSUNAGA, S
    YAMADA, T
    SHIKANO, K
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1993, E76D (01) : 38 - 43