Maximum Entropy Model of Synonym Selection in Post-editing Machine Translation into Kazakh Language

被引:0
|
作者
Shormakova, Assem [1 ]
Tukeyev, Ualsher [1 ]
机构
[1] Al Farabi Kazakh Natl Univ, Alma Ata, Kazakhstan
关键词
Maximum entropy model; synonym selection; post-editing; machine translation; Kazakh;
D O I
10.1007/978-3-031-70259-4_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The work presents a model, algorithm, and experimental studies for selecting synonyms of incorrectly translated words from English into Kazakh within the framework of machine translation post-editing technology. As a model for choosing synonyms, a maximum entropy model has been developed, the distinctive feature of which is the consideration of contextual words located at any distance from the translated word in a sentence (non-consecutive collocations), which takes into account the peculiarities of the Kazakh language. A feature of the proposed solution algorithm for this model is the use of the semantic cube model proposed by the authors. The developed maximum entropy model was learned on the 250 0000 sentences parallel Kazakh-English corpus, and a test set containing 25,000 sentences was conducted. Experiments on post-editing of machine translation of the Kazakh language compared with machine translation of Google Translate showed an improvement in the BLEU metric by 6 positions.
引用
收藏
页码:111 / 123
页数:13
相关论文
共 50 条
  • [41] Machine Translation and Post-editing in AVT: A Pilot Study on a Rising Practice
    Arnaiz-Uzquiza, Veronica
    Gonzalez, Paula Igareda
    TRANS-REVISTA DE TRADUCTOLOGIA, 2023, (27): : 197 - 214
  • [42] Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation
    Liyanapathirana, Jeevanthi
    Popescu-Belis, Andrei
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 2232 - 2239
  • [43] Repairing texts: Empirical investigations of machine translation post-editing processes
    Kittredge, R
    META, 2002, 47 (03) : 435 - 436
  • [44] Machine Translation and Post-editing: Impact of Training and Directionality on Quality and Productivity
    Toledo Baez, M. Cristina
    TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2018, (16): : 24 - 34
  • [45] MMPE: A Multi-Modal Interface for Post-Editing Machine Translation
    Herbig, Nico
    Duewel, Tim
    Pal, Santanu
    Meladaki, Kalliopi
    Monshizadeh, Mahsa
    Krueger, Antonio
    van Genabith, Josef
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1691 - 1702
  • [46] Machine translation: key concepts, evaluation processes and post-editing techniques
    Ferragud, Maria Ferragud
    TRANS-REVISTA TRANSCULTURAL DE MUSICA, 2022, 26 : 394 - +
  • [47] AuthorsDecisions in projects using machine translation and post-editing - an interview study
    Nitzke, Jean
    Canfora, Carmen
    Hansen-Schirra, Silvia
    Kapnas, Dimitrios
    JOURNAL OF SPECIALISED TRANSLATION, 2024, (41): : 127 - 148
  • [48] Online adaptation strategies for statistical machine translation in post-editing scenarios
    Martinez-Gomez, Pascual
    Sanchis-Trilles, German
    Casacuberta, Francisco
    PATTERN RECOGNITION, 2012, 45 (09) : 3193 - 3203
  • [49] MACHINE TRANSLATION AND POST-EDITING IN WILDLIFE DOCUMENTARIES: CHALLENGES AND POSSIBLE SOLUTIONS
    Ortiz-Boix, Carla
    HERMENEUS, 2016, (18): : 269 - 313
  • [50] The impact of Google Neural Machine Translation on Post-editing by student translators
    Yamada, Masaru
    JOURNAL OF SPECIALISED TRANSLATION, 2019, (31): : 87 - 106