Apertium-fin-eng-Rule-based shallow machine translation for WMT 2019 shared task

被引:0
|
作者
Pirinen, Tommi A. [1 ]
机构
[1] Univ Hamburg, Hamburger Zentrum Sprachkorpora, Hamburg, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper I describe a rule-based, bidirectional machine translation system for the Finnish-English language pair. The original system is based on the existing data of FinnWordNet, omorfi and apertium-eng. I have built the disambiguation, lexical selection and translation rules by hand. The dictionaries and rules have been developed based on the shared task data. I describe in this article the use of the shared task data as a kind of a test-driven development workflow in RBMT development and show that it suits perfectly to a modern software engineering continuous integration workflow of RBMT and yields big increases to BLEU scores with minimal effort. The system described in the article is mainly developed during shared tasks.
引用
收藏
页码:335 / 341
页数:7
相关论文
共 12 条
  • [1] PROMT Systems for WMT 2019 Shared Translation Task
    Molchanov, Alexander
    [J]. FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), 2019, : 302 - 307
  • [2] UDS-DFKI Submission to the WMT2019 Similar Language Translation Shared Task
    Pal, Santanu
    Zampieri, Marcos
    van Genabith, Josef
    [J]. FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), VOL 3: SHARED TASK PAPERS, DAY 2, 2019, : 219 - 223
  • [3] JUMT at WMT2019 News Translation Task: A Hybrid approach to Machine Translation for Lithuanian to English
    Mahata, Sainik Kumar
    Garain, Avishek
    Rayala, Adityar
    Das, Dipankar
    Bandyopadhyay, Sivaji
    [J]. FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), 2019, : 283 - 286
  • [4] Findings of the WMT 2019 Biomedical Translation Shared Task: Evaluation for MEDLINE Abstracts and Biomedical Terminologies
    Bawden, Rachel
    Cohen, K. Bretonnel
    Grozea, Cristian
    Yepes, Antonio Jimeno
    Kittner, Madeleine
    Krallinger, Martin
    Mah, Nancy
    Neveol, Aurelie
    Neves, Mariana
    Soares, Felipe
    Siu, Amy
    Verspoor, Karin
    Navarro, Maika Vicente
    [J]. FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), VOL 3: SHARED TASK PAPERS, DAY 2, 2019, : 29 - 53
  • [5] JU-Saarland Submission in the WMT2019 English-Gujarati Translation Shared Task
    Mondal, Riktim
    Nayek, Shankha Raj
    Chowdhury, Aditya
    Pal, Santanu
    Naskar, Sudip Kumar
    van Genabith, Josef
    [J]. FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), 2019, : 308 - 313
  • [6] Apertium: a free/open-source platform for rule-based machine translation
    Forcada, Mikel L.
    Ginesti-Rosell, Mireia
    Nordfalk, Jacob
    O'Regan, Jim
    Ortiz-Rojas, Sergio
    Antonio Perez-Ortiz, Juan
    Sanchez-Martinez, Felipe
    Ramirez-Sanchez, Gema
    Tyers, Francis M.
    [J]. MACHINE TRANSLATION, 2011, 25 (02) : 127 - 144
  • [7] Sharing resources between free/open-source rule-based machine translation systems: Grammatical Framework and Apertium
    Detrez, Gregoire
    Sanchez-Cartagena, Victor M.
    Ranta, Aarne
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4394 - 4400
  • [8] Recent advances in Apertium, a free/open-source rule-based machine translation platform for low-resource languages
    Khanna, Tanmai
    Washington, Jonathan N.
    Tyers, Francis M.
    Bayatli, Sevilay
    Swanson, Daniel G.
    Pirinen, Tommi A.
    Tang, Irene
    Font, Hector Alos i
    [J]. MACHINE TRANSLATION, 2021, 35 (04) : 475 - 502
  • [9] A Rule-based Shallow-transfer Machine Translation System for Scots and English
    Abercrombie, Gavin
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 578 - 584
  • [10] Robust Machine Translation with Domain Sensitive Pseudo-Sources: Baidu-OSU WMT19 MT Robustness Shared Task System Report
    Zheng, Renjie
    Liu, Hairong
    Ma, Mingbo
    Zheng, Baigong
    Huang, Liang
    [J]. FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), 2019, : 559 - 564