Boosting Thai Syllable Speech Recognition Using Acoustic Models Combination

被引：4

作者：

Tangwongsan, Supachai ^{[1
]}

Phoophuangpairoj, Rong ^{[1
]}

机构：

[1] Mahidol Univ, Dept Comp Sci, Bangkok 10400, Thailand

来源：

ICCEE 2008: PROCEEDINGS OF THE 2008 INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING | 2008年

关键词：

D O I：

10.1109/ICCEE.2008.130

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, a highly effective system for Thai speech recognition is proposed. The speech recognizer for so-called speaker-independent is created by using Continuous Density Hidden Markov Model (CDHMM). In the acoustic level, the models trained for both speaker genders, and for each separate gender are investigated and tested in terms Of accuracy. Experimental evaluation shows that with the acoustic models combination, the accuracy could be improved considerably in the acoustic level. The acoustic combination can support spoken utterances from both genders and still provide the high accuracy simultaneously. Interestingly, when using the acoustic models combination, the syllable accuracy of 89.84% is achieved with 4.53% improvement over using the conventional acoustic models trained for both genders.

引用

页码：568 / 572

页数：5

共 50 条

[1] Boosting acoustic models in large vocabulary speech recognition
Meyer, C
Schramm, H
[J]. PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2004, : 255 - 260
[2] Context-independent acoustic models for Thai speech recognition
Kasuriya, S
Kanokphara, S
Thatphithakkul, N
Cotsomrong, P
Sunpethniyom, T
[J]. IEEE INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2004 (ISCIT 2004), PROCEEDINGS, VOLS 1 AND 2: SMART INFO-MEDIA SYSTEMS, 2004, : 991 - 994
[3] Boosting HMM acoustic models in large vocabulary speech recognition
Meyer, C
Schramm, H
[J]. SPEECH COMMUNICATION, 2006, 48 (05) : 532 - 548
[4] Trajectory clustering of syllable-length acoustic models for continuous speech recognition
Han, Yan
Hamalainen, Annika
Boves, Lou
[J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1169 - 1172
[5] Dual stream speech recognition using articulatory syllable models
Puurula, Antti
Van Compernolle, Dirk
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2010, 13 (04) : 219 - 230
[6] Highly efficient and effective techniques for Thai syllable speech recognition
Tangwongsan, S
Po-Aramsri, P
Phoophuangpairoj, R
[J]. ADVANCES IN COMPUTER SCIENCE - ASIAN 2004, PROCEEDINGS, 2004, 3321 : 259 - 270
[7] Multimodal Data Fusion of Electromyography and Acoustic Signals for Thai Syllable Recognition
Jong, Nida Sae
de Herrera, Alba Garcia Seco
Phukpattaranont, Pornchai
[J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (06) : 1997 - 2006
[8] Investigating The Use Of Syllable Acoustic Units For Amharic Speech Recognition
Dribssa, Adey Edessa
Tachbelie, Martha Yifiru
[J]. PROCEEDINGS OF THE 2015 12TH IEEE AFRICON INTERNATIONAL CONFERENCE - GREEN INNOVATION FOR AFRICAN RENAISSANCE (AFRICON), 2015,
[9] Context Dependent Syllable Acoustic Model for Continuous Chinese Speech Recognition
Wu, Hao
Wu, Xihong
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1961 - 1964
[10] TASK ADAPTATION IN SYLLABLE TRIGRAM MODELS FOR CONTINUOUS SPEECH RECOGNITION
MATSUNAGA, S
YAMADA, T
SHIKANO, K
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1993, E76D (01) : 38 - 43

← 1 2 3 4 5 →