Acoustic Features for Hidden Conditional Random Fields-Based Thai Tone Classification

被引:2
|
作者
Kertkeidkachorn, Natthawut [1 ]
Punyabukkana, Proadpran [1 ]
Suchato, Atiwong [1 ]
机构
[1] Chulalongkorn Univ, Dept Comp Engn, Fac Engn, Bangkok, Thailand
关键词
Design; Algorithms; Experimentation; Performance; Thai tone classification; hidden conditional random fields; acoustic features; tone features; energy; spectral information; RECOGNITION;
D O I
10.1145/2833088
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the Thai language, tone information is necessary for Thai speech recognition systems. Previous studies show that many acoustic cues are attributed to shapes of tones. Nevertheless, most Thai tone classification studies mainly adopted F-0 values and their derivatives without considering other acoustic features. In this article, other acoustic features for Thai tone classification are investigated. In the experiment, energy values and spectral information represented by three spectral-based features including the LPC-based feature, PLP-based feature, and MFCC-based feature are applied to the HCRF-based Thai tone classification, which was reported as the best approach for Thai tone classification. The energy values provide an error rate reduction of 22.40% in the isolated word scenario, while there are slight improvements in the continuous speech scenario. On the contrary, spectral-based features greatly contribute to Thai tone classification in the continuous-speech scenario, whereas spectral-based features slightly degrade performances in the isolated-word scenario. The best achievement in the continuous-speech scenario is obtained from the PLP-based feature, which yields an error rate reduction of 13.90%. Therefore, findings in this article are that energy values and spectral-based features, especially the PLP-based feature, are the main contributors to the improvement of the performances of Thai tone classification in the isolated-word scenario and the continuous-speech scenario, respectively.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] Learning sparse conditional random fields to select features for land development classification
    Zhong, Ping
    Liu, Fang
    Wang, Runsheng
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2011, 32 (15) : 4203 - 4219
  • [32] Variational Hidden Conditional Random Fields with Beta Processes
    Luo, Chen
    Sun, Shiliang
    Zhao, Jing
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017,
  • [33] Re-Ranking Approach of Spoken Term Detection Using Conditional Random Fields-Based Triphone Detection
    Sawada, Naoki
    Nishizaki, Hiromitsu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10): : 2518 - 2527
  • [34] Hidden Conditional Random Fields for Visual Speech Recognition
    Pass, Adrian
    Zhang, Jianguo
    Stewart, Darryl
    2009 13TH INTERNATIONAL MACHINE VISION AND IMAGE PROCESSING CONFERENCE, 2009, : 117 - 122
  • [35] Hyperparameter tuning for hidden unit conditional random fields
    Yang, Eun-Suk
    Kim, Jong Dae
    Park, Chan-Young
    Song, Hye-Jeong
    Kim, Yu-Seop
    ENGINEERING COMPUTATIONS, 2017, 34 (06) : 2054 - 2062
  • [36] Weakly Supervised Cervical Histopathological Image Classification Using Multilayer Hidden Conditional Random Fields
    Li, Chen
    Chen, Hao
    Xue, Dan
    Hu, Zhijie
    Zhang, Le
    He, Liangzi
    Xu, Ning
    Qi, Shouliang
    Ma, He
    Sun, Hongzan
    INFORMATION TECHNOLOGY IN BIOMEDICINE, 2019, 1011 : 209 - 221
  • [37] ATTITUDE CLASSIFICATION IN ADJACENCY PAIRS OF A HUMAN-AGENT INTERACTION WITH HIDDEN CONDITIONAL RANDOM FIELDS
    Barriere, Valentin
    Clavel, Chloe
    Essid, Slim
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4949 - 4953
  • [39] Modeling Broad Context for Tone Recognition with Conditional Random Fields
    Wang, Siwei
    Levow, Gina-Anne
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2300 - +
  • [40] Learning flexible features for conditional random fields
    Stewart, Liam
    He, Xuming
    Zemel, Richard S.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (08) : 1415 - 1426