A Comparison of Data-Driven Automatic Syllabification Methods

被引:0
|
作者
Adsett, Connie R. [1 ]
Marchand, Yannick [1 ]
机构
[1] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 1W5, Canada
关键词
Natural language processing; machine learning; automatic syllabification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although automatic syllabification is an important component in several natural language tasks, little has been done to compare the results of data-driven methods on a wide range of languages. This article compares the results of five data-driven syllabification algorithms (Hidden Markov Support Vector Machines, IB1, Liang's algorithm, the Look Up Procedure, and Syllabification by Analogy) on nine European languages in order to determine which algorithm performs best over all. Findings show that all algorithms achieve a mean word accuracy across all lexicons of over 90%. However, Syllabification by Analogy performs better than the other algorithms tested with a mean word accuracy of 96.84% (standard deviation of 2.93) whereas Liang's algorithm, the standard for hyphenation (used in TEX), produces the second best results with a mean of 95.67% (standard deviation of 5.70).
引用
收藏
页码:174 / 181
页数:8
相关论文
共 50 条
  • [21] Comparison of Data-Driven Methods for Evaluating Earthquake-Induced Liquefaction Potential
    Hu, Jilei
    Liu, Huabei
    INFORMATION TECHNOLOGY IN GEO-ENGINEERING, 2020, : 353 - 364
  • [22] Comparison of Regression Methods on Data-Driven Controller Design for Systems with Observation Noise
    Ashida, Yoichiro
    PROCEEDINGS OF ISCIT 2021: 2021 20TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2021, : 55 - 58
  • [23] A comparison between model-based and data-driven leak localization methods
    Romero-Ben, Luis
    Blesa, Joaquim
    Cembrano, Gabriela
    Puig, Vicenc
    IFAC PAPERSONLINE, 2023, 56 (02): : 737 - 742
  • [24] Simulation Comparison among Three Data-Driven Control Methods for the Planar Manipulator
    Hou, Mengxue
    Jin, Shangtai
    2015 10TH ASIAN CONTROL CONFERENCE (ASCC), 2015,
  • [25] Comparison of data-driven thresholding methods using directed functional brain networks
    Manickam, Thilaga
    Ramasamy, Vijayalakshmi
    Doraisamy, Nandagopal
    REVIEWS IN THE NEUROSCIENCES, 2025, 36 (02) : 119 - 138
  • [26] Comparison of Major LiDAR Data-Driven Feature Extraction Methods for Autonomous Vehicles
    Fernandes, Duarte
    Nevoa, Rafael
    Silva, Antonio
    Simoes, Claudia
    Monteiro, Joao
    Novais, Paulo
    Melo, Pedro
    TRENDS AND INNOVATIONS IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 2, 2020, 1160 : 574 - 583
  • [27] Comparison of data-driven prediction methods for comprehensive coke ratio of blast furnace
    Zhai, Xiuyun
    Chen, Mingtong
    HIGH TEMPERATURE MATERIALS AND PROCESSES, 2023, 42 (01)
  • [28] A Comparison of Data-driven Methods for Patient Motion Estimation in Cardiac SPECT Imaging
    Mukherjee, Joyeeta Mitra
    Dey, Joyoni
    Konik, Arda
    Hutton, Brian F.
    King, Michael A.
    2012 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE RECORD (NSS/MIC), 2012, : 3468 - 3472
  • [29] A Data-Driven Analysis of Robust Automatic Piano Transcription
    Edwards, Drew
    Dixon, Simon
    Benetos, Emmanouil
    Maezawa, Akira
    Kusaka, Yuta
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 681 - 685
  • [30] Optimization of data-driven filterbank for automatic speaker verification
    Sarangi, Susanta
    Sahidullah, Md
    Saha, Goutam
    DIGITAL SIGNAL PROCESSING, 2020, 104