Efficient two-stage processing for joint sequence model-based Thai grapheme-to-phoneme conversion

被引:8
|
作者
Rugchatjaroen, Anocha [1 ]
Saychum, Sittipong [1 ]
Kongyoung, Sarawoot [1 ]
Chootrakool, Patcharika [1 ]
Kasuriya, Sawit [1 ]
Wutiwiwatchai, Chai [1 ]
机构
[1] Natl Sci & Technol Dev Agcy, Natl Elect & Comp Technol Ctr, Khlong Luang, Thailand
关键词
Grapheme to phoneme conversion; Thai; Joint sequence modelling; Thai pseudo-syllable;
D O I
10.1016/j.specom.2018.12.003
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Thai grapheme-to-phoneme conversion (G2P) is a challenging task due to its difficulties found by many previous studies. This paper introduces a novel two-stage processing for Thai G2P. The first stage uses Conditional Random Fields (CRF) to segment input text into pseudo-syllable (PS) units, the smallest unit with pronunciation inconfusable. This first CRF simultaneously segments input text into PS units and predicts the function of each character. Outputs from the first stage are used to efficiently align graphemes and phonemes, forming graphone joint sequences as input for the next stage. The second stage uses another CRF to model the graphone joint sequences. The character function predicted by the first stage is the cue to explicitly solve some critical Thai G2P difficulties such as hidden syllables often appeared in loan words and complicated character ordering. An evaluation is done using a large pronunciation dictionary that covers over 70% of Thai word usage. Experimental results show that 6.55% and 8.43% word error rates (WER) are obtained at the first and the second prediction states, while the overall G2P achieves a 9.94% WER. This is as much as 14.49% absolute improvement from a baseline model using Context Free Grammar (CFG) syllabification and syllable n-gram modeling.
引用
收藏
页码:105 / 111
页数:7
相关论文
共 50 条
  • [31] Two-Stage Learning Model-Based Angle Diversity Method for Underwater Acoustic Array
    Zhang, Yu
    Zhang, Dan
    Han, Zhen
    Jiang, Peng
    [J]. MARINE GEODESY, 2023, 46 (03) : 216 - 250
  • [32] Model-Based Inference for Two-Stage Cluster Samples Subject to Nonignorable Item Nonresponse
    Yuan, Ying
    Little, Roderick J. A.
    [J]. JOURNAL OF OFFICIAL STATISTICS, 2008, 24 (02) : 193 - 211
  • [33] A Joint Dispatch Operation Method of Hydropower and Photovoltaic: Based on the Two-Stage Hedging Model
    Xie, Tuo
    Liu, Hong
    Zhang, Gang
    Zhang, Kaoshe
    Li, Pai
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [34] Two-stage visual localisation: Landmark-based pose initialisation and model-based pose refinement
    Chen, ZZ
    Pe, P
    McDermid, J
    Pears, N
    [J]. 2005 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2005, : 3763 - 3769
  • [35] Efficient Image Details Preservation of Image Processing Pipeline Based on Two-Stage Tone Mapping
    Xu, Weijian
    Cai, Yuyang
    Qian, Feng
    Hu, Yuan
    Yan, Jingwen
    [J]. MATHEMATICS, 2024, 12 (10)
  • [36] Quality-Enhancing Techniques for a Two-Stage Model-Based Approach for Magnetic Particle Imaging
    Gapyak, Vladyslav
    Maerz, Thomas
    Weinmann, Andreas
    [J]. INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2022, ICNAAM-2022, 2024, 3094
  • [37] Two-Stage Polsar Scattering Model-Based Classification Scheme for Improved Glacier Facies Mapping
    Panwar, Ruby
    Kumar, Amit
    Kumar, Praveen
    [J]. JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2024,
  • [38] Compound Interference Suppression for Bistatic FDA-MIMO Radar Based on Joint Two-Stage Processing
    Sun, Wenhao
    Lan, Lan
    Liao, Guisheng
    Qi, Jiawei
    [J]. 2022 IEEE 12TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2022, : 375 - 379
  • [39] RBF-ARX model-based two-stage scheduling RPC for dynamic systems with bounded disturbance
    Feng Zhou
    Hui Peng
    Xiaoyong Zeng
    Xiaoying Tian
    [J]. Neural Computing and Applications, 2019, 31 : 4185 - 4200
  • [40] RBF-ARX model-based two-stage scheduling RPC for dynamic systems with bounded disturbance
    Zhou, Feng
    Peng, Hui
    Zeng, Xiaoyong
    Tian, Xiaoying
    [J]. NEURAL COMPUTING & APPLICATIONS, 2019, 31 (08): : 4185 - 4200