A Free/Open-Source Morphological Analyser and Generator for Sakha

被引:0
|
作者
Ivanova, Sardana [1 ,2 ]
Washington, Jonathan N. [1 ,2 ]
Tyers, Francis M. [1 ,2 ]
机构
[1] Helsingin Yliopisto, Helsinki 00014, Finland
[2] Indiana Univ, Swarthmore Coll, Swarthmore, PA 19081 USA
关键词
morphology; Sakha; Turkic languages; FSTs; finite-state morphology; marginalised languages; EDUCATION;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present, to our knowledge, the first ever published morphological analyser and generator for Sakha, a marginalised language of Siberia. The transducer, developed using HFST, has coverage of solidly above 90%, and high precision. In the development of the analyser, we have expanded linguistic knowledge about Sakha, and developed strategies for complex grammatical patterns. The transducer is already being used in downstream tasks, including computer assisted language learning applications for linguistic maintenance and computational linguistic shared tasks.
引用
收藏
页码:5137 / 5142
页数:6
相关论文
共 50 条
  • [1] Open-Source Elastic CGRA Generator
    Vazquez, Daniel
    Rodriguez, Alfonso
    Otero, Andres
    [J]. PROCEEDINGS OF THE 21ST ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2024-WORKSHOPS AND SPECIAL SESSIONS, CF 2024 COMPANION, 2024, : 83 - 86
  • [2] An open-source binary utility generator
    Baldassin, Alexandro
    Centoducatte, Paulo
    Rigo, Sandro
    Casarotto, Daniel
    Santos, Luiz C. V.
    Schultz, Max
    Furtado, Olinto
    [J]. ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2008, 13 (02)
  • [3] PyConTurb: an open-source constrained turbulence generator
    Rinker, Jennifer M.
    [J]. SCIENCE OF MAKING TORQUE FROM WIND (TORQUE 2018), 2018, 1037
  • [4] Surge: a fast open-source chemical graph generator
    Brendan D. McKay
    Mehmet Aziz Yirik
    Christoph Steinbeck
    [J]. Journal of Cheminformatics, 14
  • [5] NeuroDAC: an open-source arbitrary biosignal waveform generator
    Powell, M. P.
    Anso, J.
    Gilron, R.
    Provenza, N. R.
    Allawala, A. B.
    Sliva, D. D.
    Bijanki, K. R.
    Oswalt, D.
    Adkinson, J.
    Pouratian, N.
    Sheth, S. A.
    Goodman, W. K.
    Jones, S. R.
    Starr, P. A.
    Borton, D. A.
    [J]. JOURNAL OF NEURAL ENGINEERING, 2021, 18 (01)
  • [6] CoilGen: Open-source MR coil layout generator
    Amrein, Philipp
    Jia, Feng
    Zaitsev, Maxim
    Littin, Sebastian
    [J]. MAGNETIC RESONANCE IN MEDICINE, 2022, 88 (03) : 1465 - 1479
  • [7] Surge: a fast open-source chemical graph generator
    McKay, Brendan D.
    Yirik, Mehmet Aziz
    Steinbeck, Christoph
    [J]. JOURNAL OF CHEMINFORMATICS, 2022, 14 (01)
  • [8] MorAz: an Open-source Morphological Analyzer for Azerbaijani Turkish
    Ozenc, Berke
    Ehsani, Razieh
    Solak, Ercan
    [J]. CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2018, : 25 - 29
  • [9] A New Integrated Open-source Morphological Analyzer for Hungarian
    Novak, Attila
    Siklosi, Borbala
    Oravecz, Csaba
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1315 - 1322
  • [10] Free/open-source machine translation: preface
    Sanchez-Martinez, Felipe
    Forcada, Mikel L.
    [J]. MACHINE TRANSLATION, 2011, 25 (02) : 83 - 86