Maximum Entropy Model of Synonym Selection in Post-editing Machine Translation into Kazakh Language

被引:0
|
作者
Shormakova, Assem [1 ]
Tukeyev, Ualsher [1 ]
机构
[1] Al Farabi Kazakh Natl Univ, Alma Ata, Kazakhstan
关键词
Maximum entropy model; synonym selection; post-editing; machine translation; Kazakh;
D O I
10.1007/978-3-031-70259-4_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The work presents a model, algorithm, and experimental studies for selecting synonyms of incorrectly translated words from English into Kazakh within the framework of machine translation post-editing technology. As a model for choosing synonyms, a maximum entropy model has been developed, the distinctive feature of which is the consideration of contextual words located at any distance from the translated word in a sentence (non-consecutive collocations), which takes into account the peculiarities of the Kazakh language. A feature of the proposed solution algorithm for this model is the use of the semantic cube model proposed by the authors. The developed maximum entropy model was learned on the 250 0000 sentences parallel Kazakh-English corpus, and a test set containing 25,000 sentences was conducted. Experiments on post-editing of machine translation of the Kazakh language compared with machine translation of Google Translate showed an improvement in the BLEU metric by 6 positions.
引用
收藏
页码:111 / 123
页数:13
相关论文
共 50 条
  • [31] Re-thinking Machine Translation Post-Editing Guidelines
    Perez, Celia Rico
    JOURNAL OF SPECIALISED TRANSLATION, 2024, (41): : 26 - 47
  • [32] The Role of Machine Translation Quality Estimation in the Post-Editing Workflow
    Bechara, Hannah
    Orasan, Constantin
    Escartin, Carla Parra
    Zampieri, Marcos
    Lowe, William
    INFORMATICS-BASEL, 2021, 8 (03):
  • [33] Reasons for the increasing use of Machine translation followed by post-editing
    Sanchez-Martinez, Felipe
    TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2012, (10): : 150 - 156
  • [34] The raw machine translation to the professional advanced post-editing: the case of financial translation
    Peraldi, Sandrine
    REVUE FRANCAISE DE LINGUISTIQUE APPLIQUEE, 2016, 21 (01): : 67 - 90
  • [35] Translation Quality and Error Recognition in Professional Neural Machine Translation Post-Editing
    Vardaro, Jennifer
    Schaeffer, Moritz
    Hansen-Schirra, Silvia
    INFORMATICS-BASEL, 2019, 6 (03):
  • [36] IntelliCAT: Intelligent Machine Translation Post-Editing with Quality Estimation and Translation Suggestion
    Lee, Dongjun
    Ahn, Junhyeong
    Park, Heesoo
    Jo, Jaemin
    ACL-IJCNLP 2021: THE JOINT CONFERENCE OF THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE SYSTEM DEMONSTRATIONS, 2021, : 11 - 19
  • [37] Man or machine? Comparing the difficulty of human translation versus neural machine translation post-editing
    Jia, Yanfang
    Sun, Sanjun
    PERSPECTIVES-STUDIES IN TRANSLATION THEORY AND PRACTICE, 2023, 31 (05): : 950 - 968
  • [38] Human factors in machine translation and post-editing among institutional translators
    Cadwell, Patrick
    Castilho, Sheila
    O'Brien, Sharon
    Mitchell, Linda
    TRANSLATION SPACES, 2016, 5 (02) : 222 - 243
  • [39] Machine translation: key concepts, evaluation processes and post-editing techniques
    Ferragud Ferragud, Maria
    TRANS-REVISTA DE TRADUCTOLOGIA, 2022, (26): : 394 - 397
  • [40] Performance Improvement of Machine Translation System using LID and Post-editing
    Mrinalini, K.
    Sangavi, G.
    Vijayalakshmi, P.
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2134 - 2137