A Comparison of Data-Driven Automatic Syllabification Methods

被引:0
|
作者
Adsett, Connie R. [1 ]
Marchand, Yannick [1 ]
机构
[1] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 1W5, Canada
关键词
Natural language processing; machine learning; automatic syllabification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although automatic syllabification is an important component in several natural language tasks, little has been done to compare the results of data-driven methods on a wide range of languages. This article compares the results of five data-driven syllabification algorithms (Hidden Markov Support Vector Machines, IB1, Liang's algorithm, the Look Up Procedure, and Syllabification by Analogy) on nine European languages in order to determine which algorithm performs best over all. Findings show that all algorithms achieve a mean word accuracy across all lexicons of over 90%. However, Syllabification by Analogy performs better than the other algorithms tested with a mean word accuracy of 96.84% (standard deviation of 2.93) whereas Liang's algorithm, the standard for hyphenation (used in TEX), produces the second best results with a mean of 95.67% (standard deviation of 5.70).
引用
收藏
页码:174 / 181
页数:8
相关论文
共 50 条
  • [41] Data-driven assessment of eQTL mapping methods
    Michaelson, Jacob J.
    Alberts, Rudi
    Schughart, Klaus
    Beyer, Andreas
    [J]. BMC GENOMICS, 2010, 11
  • [42] Data-driven assessment of eQTL mapping methods
    Jacob J Michaelson
    Rudi Alberts
    Klaus Schughart
    Andreas Beyer
    [J]. BMC Genomics, 11
  • [43] Data-driven methods for equity similarity prediction
    Yaros, John Robert
    Imielinski, Tomasz
    [J]. QUANTITATIVE FINANCE, 2015, 15 (10) : 1657 - 1681
  • [44] Data-driven Software Security: Models and Methods
    Erlingsson, Ulfar
    [J]. 2016 IEEE 29TH COMPUTER SECURITY FOUNDATIONS SYMPOSIUM (CSF 2016), 2016, : 9 - 15
  • [45] Investigation on Data-Driven Life Prediction Methods
    Yang, Shuai
    Liu, Chaoqin
    Zhou, Xue
    Liang, Wei
    Miao, Qiang
    [J]. 2012 INTERNATIONAL CONFERENCE ON QUALITY, RELIABILITY, RISK, MAINTENANCE, AND SAFETY ENGINEERING (ICQR2MSE), 2012, : 674 - 680
  • [46] Challenges of data-driven methods in product development
    Mehlstäubl, Jan
    Gadzo, Emir
    Atzberger, Alexander
    Paetzold, Kristin
    [J]. Konstruktion, 2022, 74 (06): : 60 - 66
  • [47] Data-driven Methods for Modeling Social Perception
    Todorov, Alexander
    Dotsch, Ron
    Wigboldus, Daniel H. J.
    Said, Chris P.
    [J]. SOCIAL AND PERSONALITY PSYCHOLOGY COMPASS, 2011, 5 (10): : 775 - 791
  • [48] Comparison of Data-Driven Site Characterization Methods through Benchmarking: Methodological and Application Aspects
    Shuku, Takayuki
    Phoon, Kok Kwang
    [J]. ASCE-ASME JOURNAL OF RISK AND UNCERTAINTY IN ENGINEERING SYSTEMS PART A-CIVIL ENGINEERING, 2023, 9 (02)
  • [49] Comparison of reconstruction accuracy and efficiency among autocalibrating data-driven parallel imaging methods
    Brau, Anja C. S.
    Beatty, Philip J.
    Skare, Stefan
    Bammer, Roland
    [J]. MAGNETIC RESONANCE IN MEDICINE, 2008, 59 (02) : 382 - 395
  • [50] Comparison of Data-Driven Link Estimation Methods in Low-Power Wireless Networks
    Zhang, Hongwei
    Sang, Lifeng
    Arora, Anish
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2010, 9 (11) : 1634 - 1648